Number of Blast hits for multiple sequence fasta
1
0
Entering edit mode
9.4 years ago
mosquitoes • 0

Using command line blast v. 2.2.31+ the neither of the following commands will return only 2 subjects per query:

blastn -query multiple_sequences.fa -db database -task blastn -dust no -outfmt "7 std" -max_target_seqs 2 -out outputfile
blastn -query multiple_sequences.fa -db database -task blastn -dust no -outfmt "7 std" -num_alignments 2 -out outputfile

There are a few posts about this, but none related to submitting with multiple sequences in the fasta. Any ideas why these commands do not work for only outputting two subject sequences per query?

blast • 2.8k views
ADD COMMENT
0
Entering edit mode

Look for this command; I think it is.

-max-hsps 2
ADD REPLY
0
Entering edit mode
9.4 years ago
Lesley Sitter ▴ 610

Hi,
Just found this link;
http://seqanswers.com/forums/archive/index.php/t-27735.html

Seems it has something to do with HSP? Although i can't find any additional information about this om the NCBI site itself.
Also maybe you would want to try outfmt 6, it does not have any comment lines which makes it easier to parse (then you can just create a small script or awk argument that only takes the first hit)

ADD COMMENT
2
Entering edit mode

As far as I know, max_targets looks at the number of matched sequences, not the number of actually produced hits. Meaning, if you have multiple hits to the same sequence (say an entire chromosome) this still only counts as a single hit on the target.

ADD REPLY

Login before adding your answer.

Traffic: 2644 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6