Hello people
I'm trying to run blast (blast-2.2.26) on a fasta file. But I get this error message:
[blastall] WARNING: MaulwurfLeber_H21F7XH01DFTZ0_rank=0133853_x=1293_: Could not calculate ungapped Karlin-Altschul parameters due to an invalid query sequence or its translation. Please verify the query sequence(s) and/or filtering options
I call blast like this:
blastall -p blastp -i MaulwurfLeber_prots.fasta -d Pool_new_unclustered -o Contigs_prots_vs_New_unclustered.tab -a 8 -m 8 -e 0.001
The reads which causes the error are the following:
>MaulwurfLeber_H21F7XH01DFTZ0_rank=0133853_x=1293_0_y=1550_0_length=353_-gene_1
VEIGEVVVFGEVETVVGEAVEVEAGEVVEVEVGEVEVGEVVVGEVVVV
>MaulwurfLeber_H21F7XH01DFTZ0_rank=0133853_x=1293_0_y=1550_0_length=353_-gene_2
VRWWSVRWWSFEEVKVVVGEVEVVVGEAVEVEISEVEVGEWSR
The fasta file was created with hmmsearch called like this:
hmmsearch --tblout Contigs_prots_vs_PFAMa.tab --cpu 8 -o Contigs_prots_vs_PFAMa.out --noali Pfam-A.hmm MaulwurfLeber_prots.fasta
My calls are based on this scripts from VirSorter.
https://github.com/simroux/VirSorter
I'm running the script on a cluster.
Even though this should not be happening I wonder if blast is not liking the = and - characters in the fasta header. Can you try replacing them with an "_" and see if that helps.
You are not using the latest blast so if possible upgrade.