Hello guys,
I downloaded the NCBI files and created my local db. Everything seems to go well, but running the blastn -db /mnt/NTFS/NGS-DBs/NCBI-RefSeq/ViralSeq_2021-12-14 -query otus.fasta -evalue 1e-3 -word_size 11 -outfmt "6 staxid staxids"
I obtain N/A and 0, respectively.
Here is the complete procedure I executed:
##Create the NCBI RefSeq viruses db
#Download sequences
genome_updater.sh -d "refseq" -g "viral" -c "all" -f "genomic.fna.gz" -o "all_virus_genomes" -t 4
#Create single fasta file containing all fasta sequences from *.fna.gz files
zcat *.fna.gz > viralseq_2021-12-14.fna
#Create BLASTN database
makeblastdb -in viralseq_20212-12-14.fna -dbtype nucl -title ViralSeq -input_type fasta -out ViralSeq_2021-12-14
Please, can someone help me to understand where is my mistake? How can I solve the problem? Thank you
Hello. Thank you very much for your help. I looked for the right file to download, but the NCBI repository looks like a jungle ... Please, can you suggest me which file to download? Thank you.
All needed info should be explained on this page: https://www.ncbi.nlm.nih.gov/sites/books/NBK569841/
Hello, as suggested by lieven.sterck I built the new db with the following command (I hope the nucl_gb.accession2taxid is the right file for mapping):
Then I repeated my blastn command, but nothing changed:
I also checked if, for example, the "AC_000019.1" is contained in the mapping file (nucl_gb.accession2taxid):
It is so.
Where is my mistake?
Please.
Thank you
Use
101010
button to formatcode
. First option (that you are using) is for quoting text.Downloaded taxonomy files either need to be in the same folder as your index or in folder designated with
$BLASTDB
variable. This works with version 5 (-blastdb_version 5
) of blast database which I assume you are using if you are using the latestblast+
package.