Need 'max matches in a query range'
0
0
Entering edit mode
8.2 years ago

The ncbi online blast tool has an option of limiting the 'max matches in a query range' so that one section of your sequence doesn't fully consume all of the blast hits, leaving other portions of the segment uncharacterized by the blast. However, I can't seem to figure out a way to be NCBIWWW.qblast to have a similar result.

The only work-around I have found is to ramp up the hitlist_size (to 10,000) & filter through the blast results based on where they hit my query. While doable, I would feel more confident in the results if I could limit the 'max matches in a query range' while also keeping the file sizes down.

Anyone know of an option in NCBIWWW.qblast that allows this?

blast • 2.5k views
ADD COMMENT
0
Entering edit mode

I don't know NCBIWWW.qblast but I am assuming it runs NCBI BLAST+. In this case, I think what you're looking for is the BLAST+ -culling_limit option.

ADD REPLY
0
Entering edit mode

NCBIWWW.qblast is a biopython module that allows you to BLAST over the web (which I am hoping to do since I am using the BLAST DB refseq_protein). But it's looking like I may need to sacrifice speed for confidence in results & use BLAST+ w/ culling_limit. Thanks for your suggestion!

http://biopython.org/DIST/docs/api/Bio.Blast.NCBIWWW-module.html

ADD REPLY

Login before adding your answer.

Traffic: 2563 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6