Hi everybody,
I know that this is probably an easy question, but to be honest I am getting so confused at the moment, that I am not ashamed to ask any more.
I have a table (expressionSet) of expression values from an array experiment of two conditions. I would like to compare the two conditions to identify differentially expressed genes. I have five arrays for each of the conditions (total of 10 arrays).
The way I thought about doing it was to calculate the mean for the two conditions and than divide the treatment by the control:
gene1 mean_treat 1000
gene1 mean_ctrl 100
gene1 fold-change 1000/100 = 10
I would think, that gene1 has a differential expression of 10xfold. But if the values are already log-values? How do I calculate the differences than? Do I do it the same way? or do I subtract the two values from each other? Is this is the correct way?
gene1 mean_treat 9.965784 (log(1000,2))
gene1 mean_ctrl 6.643856 (log(100,2))
==>gene1 fold-change 9.965784/6.643856= 1.5
==>gene1 fold-change 9.965784-6.643856= 3.321928
So which one of the two fold-changes is correct?
Again I am so sorry for needing to ask this question, but I would appreciate your help and maybe someone can refer me to a paper/book to read
Thanks a lot Tomas
your hint that affy expressionSet is typically in log2 scale (and thus I need to subtract, not divide) solved my troubles reproducing an existing study and saved my day :-)