Genome sequence from protein
0
0
Entering edit mode
6.3 years ago

I've done tblast of a certain protein and got a list of hits. Is there a way to programmatically extract genomic regions for my hits?

I'm using biopython mostly

sequence alignment genome • 1.1k views
ADD COMMENT
0
Entering edit mode

Are you interested in extracting a specific region of the subjects or entire fasta sequence? What sort of databases are you using? Could you post your current code?

ADD REPLY
0
Entering edit mode

I am interested in a specific region, the database is ncbi nr

here is is the code

from Bio.Blast.Applications import NcbiblastxCommandline
tblastn='/uge_mnt/home/tim_ivanov/soft/blast/ncbi-blast-2.7.1+/bin/tblastn'
nt_db='/uge_mnt/home/tim_ivanov/DATA/fasta/nt'
blastx_cline = NcbiblastxCommandline(cmd=tblastn, query="mrp_NM_001316390.faa", db=nt_db, outfmt=5, out="opuntia.xml")
print blastx_cline
ADD REPLY

Login before adding your answer.

Traffic: 1940 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6