Hi all,
I have an assembled transcriptome (Trinity) and I am about to perform annotation. My goal is to basically made this a public database were people can blast their sequences against. Is there any tool that beside performing regular annotation, would also change the name of the header of the contig once it is annotated?
I don't want to just upload my assembled transcriptome with the header of the sequences referring only to the contig, i.e.:
>contig1
ACTCTA..
>contig2
TACCTCS..
But rather having something like this:
>contig1 annotation_protein_X
ACTCTA..
>contig2 annotation_protein_y
TACCTCS..
So that if you blast your sequences against it, you would know what is in there. Also, since the trascriptome was generated by merging multiple samples together, I would like to now blast those samples individually against the main trascriptome obtained and look at the differences. I hope that it was clear, I don't know if there is a typical approach to do so but I would like to hear some opinion.
Thanks in advance!