Question

comparing two genomes

0

Entering edit mode

9.8 years ago

dr_mm_sh • 0

I am trying to find low similarity DNA sequences from two genomes of very closely related fungal species. Blast+ alignment of concatenated genomic sequences showed hits with identities around 70~80%. However, when I retrieved these DNA sequences of the both query and subject and made blastn search against genomes of each other, I was able to see hits with 100% identities.

What do you recommend to improve the accuracy of alignment results?

Command used to produce the alignment was:

$ blastn -db db_file -query input_file.txt -out output.txt -max_target_seqs 100 -outfmt 7

blast alignment • 2.4k views

ADD COMMENT • link updated 9.8 years ago by Sishuo Wang ▴ 230 • written 9.8 years ago by dr_mm_sh • 0

score 0 · Answer 1 · 2015-10-15

Hello,

The blastn command actually executes the megablast algorithm by default. The megablast algorithm is very fast but finds fewer results than the blastn algorithm.

If you want to execute the blastn algorithm, you should try the following :

blastn -task blastn -db db_file -query input_file.txt -out output.txt -max_target_seqs 100 -outfmt 7

This post talks about this "-task" option.

score 0 · Answer 2 · 2015-10-15

0

Entering edit mode

9.8 years ago

Sishuo Wang ▴ 230

discontinuous megablast might also be helpful or you can change '-word_size' (=11 be default).

ADD COMMENT • link 9.8 years ago by Sishuo Wang ▴ 230