Entering edit mode
8.2 years ago
sukesh1411
▴
30
Hi
I could not create a blast database for nucleotide i.e nt text file which has sequences in fasta format.
because Error: Duplicate seq_ids are found.
How can remove this dup seq_ids. Can anyone help me on this
I am trying with uclust tools. The nt file which i downloaded from blast database is text file which has sequences in fasta format. This text format is not accepted by uclust tools. How can i convert text to fasta format.
can you post few lines as an example from you text file
Just
>
is missing from the header line. If your file is small you can just replacegi|
with>gi|
. If the file is huge use any codethe same question you have already posted here. It seems you have a fasta file. I assume you have got the answer for this question