I will soon have an assembled transcriptome and I have a good idea on what downstream analysis I would like to do. However, I am worried that basing the downstream functional annotation off of BLAST will cause losses in species specific differences. My interest is less in doing a basic assessment of how many genes in my model species land in the different reference organisms and more in highlighting the differences. In other words, chances are BLAST will show that there are more common genes than uncommon but I need to know the differences in the common genes.
First, I was wondering duplicating the analysis with assembled transcripts vs predicted ORFs would be worth it. Or, if I should go a step further and run the analysis using predicted peptide sequences.
Second, can anyone suggest ways to perform a more fine-grained functional/comparative analysis than just applying annotations based on BLAST results?
My model species is the only de novo sample, the rest is going to come from ensembl (Cat, Dog, Cow, Horse, Human, Rhesus, Mouse). We have some conditions I can do the DE analysis with, but it is a side note. The goal of using stimuli was to try and activate different transcription profiles providing a broader pool of transcripts. BLAST gets me a rough idea of what this gene might do, but nothing else.
I was thinking more along the lines of sequence analysis or annotation. E.g. geneX from humans has a known phosphorylation site at pos Y, but in the model organism this site seems to be missing. This project is to find leads for molecular work, and I want to go further and provide readily testable targets.