makeblastdb; it says 'Adding sequences from FASTA; added 27200239 sequences in 55053 seconds.'
0
0
Entering edit mode
9.1 years ago

I am trying to index nt database. I have used makeblastdb so far and it has worked perfectly. But now when I tried to index my NT database, I got this message - "Adding sequences from FASTA; added 27200239 sequences in 55053 seconds." I did specify output file path, but the result is not there.

./makeblastdb -in <nt path here> -title nt3.2_new -dbtype nucl -out <output path here>/nt3.2New/ -parse_seqids

Above is the command I used(<nt path here> and <output path here> I have the right path but just abbreiviated). I found the similar issue posted here. But the op found where the result file is without any explanation. I am using grep to see if I can find where it is but no luck. Any ideas or suggestions would be highly appreciated. Thanks!

makeblastdb • 2.6k views
ADD COMMENT
1
Entering edit mode

Is there a reason you are making your own when NCBI makes pre-made indexes for NT available? ftp://ftp.ncbi.nih.gov/blast/db/

ADD REPLY
0
Entering edit mode

Yes, I am using filtered NT which means I filter out environmental samples. So I have to index it on my own.

ADD REPLY
1
Entering edit mode

If you have a list of GI's for the env samples you could use the blastdb_aliastool to create the subset blast database.

Not sure if it would be any faster since you already have the filtered NT files available.

ADD REPLY

Login before adding your answer.

Traffic: 1510 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6