Hello everyone:
I am confused about what's next when I do a differential expression analysis. I made a DE analysis with one sample, two conditions and no replications (I know that I should have replications, but the experiment was done in that way).
I have the list of transcripts from the DE, but when I open both transcriptomes and look for the transcripts (say TRINITY_DN12605_c0_g1_i1), in the first transcriptome (the one of the first condition), the sequence is longer than the second transcriptome. How should I handle that in order to generate the proteome? I am doing it right? Which sequence to use (the one from the first condition or the second)?
Thanks in advance.
~g
You need to ask yourself what your biological question is that you are trying to answer.
I would like to know if exposing the specie to pathogens and forcing it to express more, it generates/activates some mechanisms of defense against the pathogens. Should I consider both transcripts (from normal and infected sample) or only from infected?
UPDATE
I was thinking about your answer and I think that only from the infected one, because is the one that is activating o expressing more due to the event in which the specie has been exposed (infected). Does that make sense?
Thanks.
Since you used Trinity I think it's safe to assume your organism of interest doesn't have a reference genome? Why would you want to generate a "the proteome"?
Anyway, since you haven't used replicates I would suggest to first validate (e.g. qPCR) some differentially expressed genes (properly, with replicates) to make sure you are working with "true" biological differential expression and not just artefacts/random shifts in expression.
Thanks for reply WouterDeCoster.
And yes, I have assembled them with Trinity, but I don't have more replicates, I only have one sample from each condition. I want to generate the proteome in order to discover peptides.
Thanks.
That would be a double prediction right? You are assembling a putative transcriptome using trinity and then predicting peptides from that?
If discovering peptides is your aim why not do a proteomic discovery experiment instead of RNAseq?
Edit: I think you are using public RNAseq data (if this is related to one of your other posts) to do this exercise (so no actual experiment involved?).
Thanks for reply @genomax2
Yes, I am using public data, and in fact, I am reproducing the experiment exposed in the paper. Also, I am assemblying the transcriptome for later generate the proteome and finally, study the proteome (I think this is what you mean by doing proteomic discovery).
Finally I was able to run the scripts mentioned above.
Thanks.
Proteomic discovery experiment would be taking extracted protein complement from this organism and identifying peptides/proteins from that prep (after proper purification/digestion etc) with mass spectrometry. Since you are not actually doing bench experiments that is not applicable. That would priovide direct evidence that those peptides were present in the proteins of the organism and are real.