I have fasta file with different headers. Basically assembled fasta files some are assembled with different versions so they got same fasta names. When I tried to make a blast database with -parse_seqids
its complaining me of duplicate id's. So I would like to add a extension to with version of assembly to its fasta headers.
Examples
Input fasta sequences: (I am just showing headers here)
>Contig1_Node1_length20_cov30 Date:03/01/2015 Sequence_Organism:Other
>Contig2_deg1 Date:03/01/2015 Sequence_Organism:Other
>Contig3_jcg20839 Date:03/01/2015 Sequence_Organism:Other
Output fasta sequences:
>Contig1_Node1_length20_cov30_V2 Date:03/01/2015 Sequence_Organism:Other
>Contig2_deg1_V2 Date:03/01/2015 Sequence_Organism:Other
>Contig3_jcg20839_V2 Date:03/01/2015 Sequence_Organism:Other
Exclude the
-parse_seqids
while creating blast database. It will not give any error.True but I need
-parse_seqids
to extract sequences from fasta file.