tblastn extend beyond homology region
1
0
Entering edit mode
4.8 years ago

Hi,

I am performing a very simple task. I got a protein sequence, with conserved domains and I am mapping this protein and domains to a genome. I found the analogous domain. However I would like to get 100 amino acids downstream and upstream the domain. Because its a tblastn I am a bit stuck on how to do that properly to get everything on the same reading frame.

Any suggestion?

Thanks !

blast • 649 views
ADD COMMENT
0
Entering edit mode
4.8 years ago

Are you trying to find the orthologous sequence from your protein in a genome ? If so, you can just extract your tblastn region with samtools faidx (like 100000bp upstream + 100000bp downstream) and then use exonerate to find the CDS of your gene on this genome, which will give you the exon sequences and also the intron borders.

ADD COMMENT

Login before adding your answer.

Traffic: 1663 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6