Hello, I have ~200000 de novo assembled plant transcripts and have to do a proteome annotation. I have run BLASTX of my transcripts against protein databases. However, I got very poor hits. My final goal is to functionally annotate my transcripts. Any suggestions? A very detailed explanation is very appreciated since I have never worked in this field. Also, any other suggestions on this topic are welcomed.
they are short, yes, my wheat is also tetraploid, so problem of homeologs. I did blast against Phytozome as well, I got hits for less than half of the contigs. At the end the only database that gave me more hits was Ensembl. Why is that half of the seq-s are non-coding? May I ask you which workflow did you use to annotate blueberry? thank you for the post