Entering edit mode
8.3 years ago
2013302630028
▴
10
When I use `makeblastdb` command to build a protein database by using a fasta file, it is ok. But when combined this fasta file together <pre>makeblastdb</pre> tell me have a BLAST options error.
My BLAST+ version is 2.4.0+, computer system version is ubuntu 16.04 and the detail imformation is below.
sqreb@sqreb-Vostro-1450:~/ORF_culster/blastdb$ makeblastdb -in progbbct1.fasta -input_type fasta -dbtype prot -parse_seqids -title 'bct_protein data in genbank' Building a new DB, current time: 07/21/2016 16:05:51 New DB name: /home/sqreb/ORF_culster/blastdb/progbbct1.fasta New DB title: bct_protein data in genbank Sequence type: Protein Keep MBits: T Maximum file size: 1000000000B Adding sequences from FASTA; added 50272 sequences in 1.98013 seconds. makeblastdb -in progbbct2.fasta -input_type fasta -dbtype prot -parse_seqids -title 'bct_protein data in genbank' Building a new DB, current time: 07/21/2016 16:06:30 New DB name: /home/sqreb/ORF_culster/blastdb/progbbct2.fasta New DB title: bct_protein data in genbank Sequence type: Protein Keep MBits: T Maximum file size: 1000000000B Adding sequences from FASTA; added 70668 sequences in 2.765 seconds. sqreb@sqreb-Vostro-1450:~/ORF_culster/blastdb$ cat -b progbbct1.fasta progbbct2.fasta >> progbbct_12.fasta sqreb@sqreb-Vostro-1450:~/ORF_culster/blastdb$ makeblastdb -in progbbct_12.fasta -input_type fasta -dbtype prot -parse_seqids -title 'bct_protein data in genbank' Building a new DB, current time: 07/21/2016 16:08:07 New DB name: /home/sqreb/ORF_culster/blastdb/progbbct_12.fasta New DB title: bct_protein data in genbank Sequence type: Protein Keep MBits: T Maximum file size: 1000000000B BLAST options error: progbbct_12.fasta does not match input format type, default input type is FASTA
This is my blast file:
>BAA21794.1|AB000100.1 intrinsic membrane protein|121..912 MVRTPVPLYLRWAVSILSVLAFLAIWQIAAASGFLGKTFPGSLRTLQDLFGWLSDPFFDN GPNDLGIGWNLLISLRRVAIGYLLATVVAIPLGIAIGMSALASSIFSPFVQLLKPVSPLA WLPIGLFLFRDSELTGVFVILISSLWPTLINTAFGVANVNPDFLKVSQSLGASRWRTILK VILPAALPSIIAGMRISMGIAWLVIVAAEMLLGTGIGYFIWNEWNNLSLPNIFSAIIIIG IVGILLDQGFRFLENQFSYAGNR >BAA21795.1|AB000100.1 malK-like protein|916..1785 MISEAVPAKEETGQAQLLIEQVGKVFTVNSPSLLDRLRQRSPKRYVALEDVNLTIASNTF VSIIGPSGCGKSTLLNLIAGLDLPTSGQILLDGQRIRSPGPDRGIVFQNYALMPWMTALE NVIFAVETARPNLSKSQAREVAREHLELVGLTKAADRYPGQISGGMKQRVAIARALSIRP KLLLMDEPFGALDALTRGYLQEEVLRIWEANKLSVVLITHSIDEALLLSDRIVVMSRGPR ATIREVIDLPAVRPRQRSVIEEDERFVKIKLRLEEHLFNETRAVEEASV ... >BAL47787.1|AB648215.1 isocitrate dehydrogenase|<1..>518 DAAVEKAYKGERKISWMEIYTGEKSTQVYGQDVWLPAETLDLIREYRVAIKGPLTTPVGG GIRSLNVALRQELDLYICLRPVRYYQGTPSPVKHPELTDMVIFRENSEDIYAGIEWKADS ADAEKVINFLREEMGVKKIRFPEHCGIGIKPCSEEGTKRLVRAAIEYAIAND >BAL47788.1|AB648216.1 isocitrate dehydrogenase|<1..>518 DAAVEKAYKGERKISWMEIYTGEKSTQVYGQDVWLPAETLDLIREYRVAIKGPLTTPVGG GIRSLNVALRQELDLYICLRPVRYYQGTPSPVKHPELTDMVIFRENSEDIYAGIEWKADS ADAEKVIKFLREEMGVKKIRFPEHCGIGIKPCSEEGTKRLVRAAIEYAIAND (END)
There is some bug in my parser and I fix it. Thank you!