Hi,
Even at this stage, I want to thank anyone will can help me.
I'm using the Limma package in order to anlayse the gene expression in different tissues(Sample). For each tissue I want to know which genes are overexpressed and underexpressed; for example I would be able to say that the gene_x is more expressed in tissue A than B (and C) but less expressed than D.
I read the manual of Limma package and I understood that the makeContrasts
function defines the type of comparison among tissues, but I didn't understand the comparison of following examples and their difference:
1) makeContrasts(B-A,C-B,C-A,levels=c("A","B","C"))
In this case the output is:
Contrasts
Levels B- A C - B C - A
A -1 0 -1
B 1 -1 0
C 0 1 1
After fit2
and eBayes
command, Which tissue is the output of topTable function?
2) makeContrasts(contrasts="A-(B+C)/2",levels=c("A","B","C"))
Contrasts
Levels A-(B+C)/2
A 1.0
B -0.5
C -0.5
This case compare the tissue(Sample) A with B and C, it's right? What is the reason of division by two?
The output of topTable
is the expression level of tissue A, it hasn't?
What is the difference beteween
makeContrasts(contrasts="A-(B+C)/2",levels=c("A","B","C"))
and makeContrasts(A-(B+C)/2,levels=c("A","B","C"))
?
3) x <- c("B-A","C-B","C-A"); makeContrasts(contrasts=x, levels=c("A","B","C"))
Contrasts
Levels B-A
A -1
B 0
C 1
I don't understand this output and the difference with makeContrasts(C-A,levels=design)
.
I think that the example 2) is suitable for my analysis but I'm not sure and I wont understand the division by the number of tissue(sample), is always necessary? and What do it means from the scientific point of view?
In the even that the second example is suitable for my analysis, I should to do an iterative procedure that compare one tissue at a time with all others; For example
the first comparison is:
makeContrasts(contrasts="A-(B+C)/2",levels=c("A","B","C"))
the second is
makeContrasts(contrasts="B-(A+C)/2",levels=c("A","B","C"))
the third is:
makeContrasts(contrasts="C-(A+B)/2",levels=c("A","B","C"))
and so on, if there are over three tissues. It's right? If this approach is wrong, How should I proceed?
Thank you very much!
Best
Thank for the answer.
I don't understand when should I specify the argument
contrast
inmakeContrasts
? Maybe I didn't understand the meaning of contrasts.2) the output in last part is the same, but in one case the
makeContrasts
returns (near Levels) the word contrast in the other one returns the specific combination of sample.However the purpose of my analysis is to obtain for each gene a ranked list of expression level in tissues, sorting in decreasing way in according to expression level; so I think that I must to compare the first group with all others but I don't know if the comparison must to do one tissue at a time or together; for example:
the results are different, but I didn't understand the scientific difference. What could be the correct approach for my analysis?
Thanks a lot again!
Next time you probably want to comment on my answer not on the original question.
A very simple way is to think of it as a see-saw where expression of each of your conditions is on one side, you want to know if the see-saw is significantly up or down for each gene so you know that gene expression is higher or lower when comparing between your conditions. When you have two conditions, then you have two things sitting on one side of the see-saw. To make the weight even on both sides, you could take the average of the two conditions. However, taking the average expression of two very diverse conditions does not really make any sense, that's why the A-B, A-C, B-C comparisons makes more sense. If you want to ask the question of is A different from anything else (B and C) then that type of analysis is more like anova, the limma manual also goes over this I believe. The limma manual does a decent job at explaining different types of analysis, I would go over it again and try and pick out the example that most closely matches your biological question.
I'm sorry, I wanted comment your answer but I was mistaken.
However, I understood that for my analysis is necessary an anova analysis. So, thanks a lot for your advice and answers, now I'm going to read better the manual and understand the analysis that I should do. If I had any possible questions (about this discussion) I would try to ask you.
Thanks a lot again.