BLAST search against the genome
0
0
Entering edit mode
10.1 years ago

Dear all,

I apologize for asking a very basic question.

Using local BLAST searches against a genome, I am trying to retrieve full length homologous sequences of a gene of interest. Ultimately, I am interested in knowing the copy number of these genes and would also like to retrieve pseudogenes, if any. However, BLAST search (tblastn; BLAST 2.2) against the downloaded genomes (e.g., M. martensii: http://www.ncbi.nlm.nih.gov/Traces/wgs/?val=AYEL01#contigs) is retrieving several partial hits. I wonder if it is because I am BLAST searching against a collection of contigs and not against an assembled scaffold. Do I need to assemble these contigs into scaffolds before BLAST searching or are these files available elsewhere? Alternately, is there any tool that would be beneficial in this process?

Thank you very much in advance,

Regards,

Kartik

blast • 4.3k views
ADD COMMENT
0
Entering edit mode

If the gene doesn't exist in a full length form in the fasta file, then blast can't return it...

ADD REPLY
0
Entering edit mode

Hey. Thanks for that quick reply. Exactly why I wonder if I need to assemble these transcripts into a scaffold before BLAST searching against it.

ADD REPLY
0
Entering edit mode

If you're only getting partial matches then the answer is yes, you'll need to assemble things further :)

ADD REPLY
0
Entering edit mode

Of course, this all assumes that the gene of interest can even align in its full length against the actual full genome, were it to exist. If that ends up not being the case, then there's nothing you can do but work with the partial hits.

ADD REPLY

Login before adding your answer.

Traffic: 1484 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6