Error When Using Makeblastdb On A Large File
2
0
Entering edit mode
12.2 years ago
vijay ★ 1.6k

Dear All,

I have created a stand alone blast on my system, to work with NGS data and it is working fine. However, I have got a sequence file which is about 2400MB, which is two large to create a local database and I am facing a error when i use the makeblastdb command.

Please can someone advise me on how to create a local database out of this file.

-Vijay

blast • 3.6k views
ADD COMMENT
1
Entering edit mode

please indicate the command that you issue and the error that you are getting

ADD REPLY
1
Entering edit mode
12.2 years ago
Sujai Kumar ▴ 270

I agree that the OP should specify the full command and error (thanks Istvan) - but just in case this helps:

makeblastdb gives an error if there are any tabs or other weird characters in the fasta file. Check your sequence headers, and replace all weird spaces with ordinary spaces.

I use the following:

perl -i -plne 's/\s+/ /g' sequence.fasta
ADD COMMENT
0
Entering edit mode
12.2 years ago
neal.platt ▴ 240

That is not an overly large file to make a blastdb from. Most vertebrate genomes are that size...granted you will have millions of reads compared to dozens of chromosomes.

I tested making a blastdb on a short read file of similar size and it was fine. Nothing special..

formatdb -pF -i short.read.fa

What type of error are you getting specifically?

ADD COMMENT

Login before adding your answer.

Traffic: 2859 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6