Hi dear biostars.
I got hand on some microarray data which is already "user-friendly" processed. Unfortunatelly, I get very nervous if I'm supposed to work with data I don't understand. I will try to explain, what I think I understood so far.
So I have some columns with "RMA_log2Signal", which is already normalized. Then there is another column with "fold change".
When I calculate
- Signal control - Signal experiment = signal fold change (e.g. 1,6-11,3 = -9,7)
- 1/(2^signal fold change) (1/2^(-9,7) = 832)
I get the "fold change" documented. Does this mean that my gene of interest is 832 fold higher express than in the control condition?
Thanks for your advice, or also google tipps...
Cheers, Jaccy
Hello Jaccy. Which array type and version is it? Mainly interested to hear if it is single- or two-channel (colour).
Hey Kevin, it's single-channel (Affymetrix Gene chip)
Cool - great. The advice that Ahill gives is pretty good. I should say, too, that you could technically keep just the RMA_log2Signal columns and still proceed to using, for example, limma, for the purposes of doing a differential expression analysis. Limma will calculate moderated statistics via empirical Bayes. Currently, it is obvious that the fold change that is in your data was calculated manually.
You would obviously also only keep these 'RMA' columns when generating box-and-whisker plots, histograms, cluster dendrograms, etc.
Also remember that the 832 is a linear fold change difference - pretty high but can happen with this type of data.
Thank you so much for the advices, this will help me a lot!