Hey so for the past two days I have been trying to install and execute a stand alone blast named ncbi-blast-2.2.30+
on a centos os system. I managed to download a nr ref sequence from ncbi ftp using the command wget ftp://ftp.ncbi.nlm.nih.gov/blast/db/FASTA/nr.gz
and extracted the file using the command tar -xvpf nr.gz
. I got an nr file of 33 gb size but when I try to format the file using the command
./makeblastdb -in /home/Desktop/ncbi-blast-2.2.30+/db/nr -dbtype 'nucl' -input_type 'fasta' -out /home/Desktop/ncbi-blast-2.2.30+/output
I get the error showing
BLAST Database creation error: FASTA-Reader: No residues given
Can anyone give any suggestions on the nature of the problem and how I can solve it?
You need a
z
in your tar command for gzip files. Use-xvzf
. The file is probably not unzipped correctly. but also, the nr database is protein ("Non-Redundant peptides") so you will be creating a protein database. Think about whether that is what you want and why. I believe that if you ran the makeblastdb correctly it would tell you that you are mixing up protein and nucleotides. If you want the whole nucleotide database it is called 'nt ("Non-Translated nucleotides). I can't remember if they are the official acronyms or whether it's just what my brain uses to keep them the correct way around.