Hey All, I downloaded an 88mb genome from ncbi and tried to use tblastn to find a protein homolog after building the database for the organism. I encountered the following error, and looking for ways to overcome it. By the way, I have done this beofre and it worked fine but not this time. I checked that line and it looked fine. I assume there might be error in my downloaded. I repeated the download twice and still got the same error.
command
tblastn -db genome -query protein.txt -out file2.txt
Error
Error: NCBI C++ Exception: T0 "/private/tmp/blast-20170511-49816-1rw4ib1/ncbi-blast-2.6.0+-src/c++/src/objtools/readers/fasta.cpp", line 2428: Error: CFastaReader: Near line 1, there's a line that doesn't look like plausible data, but it's not marked as defline or comment. (m_Pos = 1)
I think the error is rather triggered by your input fasta file. Does that one looks OK?
Yes. the fasta looks fine to me. Anyway to check other than visually scanning through?