How can I get 100 percent identity and coverage when I run blastp?
1
0
Entering edit mode
4.4 years ago
Obex33 ▴ 30

Hi Biostars,

I would like to get outputs with 100 percent identity and coverage in blastp. This is my command

blastp -task blastp-short -query infile.fa  -db database.fa -out outfile.txt -outfmt 6

Thank you

alignment • 1.3k views
ADD COMMENT
3
Entering edit mode
4.4 years ago
hugo.avila ▴ 530

It may exist an easiest way to do it, but here is how i do:

replace the last arg (-output) of your command by this:

-outfmt '6 qseqid sseqid pident length mismatch gapopen qstart qend sstart send evalue bitscore qcovs qcovhsp'

And then with some kind of text manning tool, filter by pident and qcov:

awk '$3 == 100 && $13 == 100 {print}' outfile.txt > filtered_outfile.txt

Be aware of multiple hits for the same query, choose the ones that have the best evalue and bitscore.

ADD COMMENT
1
Entering edit mode

Thanks @hugo.avila works perfectly

ADD REPLY

Login before adding your answer.

Traffic: 2365 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6