How to prevent blastp from aligning small parts of query sequence and scoring that high?
0
0
Entering edit mode
7.2 years ago
hodayabeer ▴ 10

Hi, I was blasting proteome of yeast against a large database of Archaea. Something is wrong with the results because I got too large P Identity for too much proteins. I assume that the problem is that blast scores small parts of the query sequence once it found an alignment to that, and therefore gives good scores even though most of the query sequence doesn't match. How can prevent blast from giving me small alignments? maybe setting the window_size parameter? or normalizing the bitScore with the query length? does hsps have to do something with that? Thanx soo much!

blast alignment sequence • 2.1k views
ADD COMMENT
0
Entering edit mode

Can you give an example?

You can screen for results with lower p-value, coverage or identity (using tabular format output)

ADD REPLY
0
Entering edit mode

but I want to get only full alignments of the query sequence

ADD REPLY
0
Entering edit mode

if you are getting the BLAST output in the tabular format then you can get it display alignment length and parse the output for preferred alignment length.

ADD REPLY
0
Entering edit mode

how can I extract only the results where the alignment lengths is equal or close to the query / subject length?

ADD REPLY
0
Entering edit mode

It's called glocal mode (global-local), unfortunately not implemented in BLAST.

ADD REPLY
0
Entering edit mode

how can I extract only the results where the alignment lengths is equal or close to the query / subject length?

ADD REPLY
0
Entering edit mode

Output the results in tabular mode (-outfmt 6) and then filter using awk/excel/R etc.

ADD REPLY

Login before adding your answer.

Traffic: 1950 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6