Hi,
I've a general question about deseq2 designs. Let say I've a experiment with three factors A, B and C. Each factor is independent from each other. I want to assess differential expression based on factor A. Is the following design correct ?
~C+B+A
other question. Is the design ~B+C+A the same thing than doing ~C+B+A ?
Thanks
re: plotting. After the first release, I realized that it's better to have the output of
results()
be the input toplotMA
, so users have to specify the comparison to plot.plotMA(dds)
is really justplotMA(results(dds))
and so the docs don't mentionplotMA(dds)
anymoreTime for me to read the most recent docs :)
OK thanks. But let's say that factor A represent the cell type and factor B the presence of a specific mutation in a gene. I want to detect the effect of the presence of this specific mutation (let's drop factor C for now). So my design would be
~A+B
. But if the order in the design in not important; and as the transcriptome of different cell types are very different, DESeq will give me the genes that are differentially expressed based mostly on factor A ( I don't expect to have an important effect due to the mutation defined in factor B). Is that correct?The the effect of B is small, then yes, you should get mostly (perhaps only) DE genes due to A.
so if I use the design ~B it would only give me genes DE based on the mutation (factor B) but a problem arise due to different cell types comparison.. Not simple..
I suspect you're forgetting that you can use
~A+B
and specify which coefficient you want the results for (seehelp(results)
). So you can still use~A+B
and only get the results forB
.someting like
Yup, that'd give you the effect of Bmut while controlling for everything else.
I tried both
results(dds)
andresults(dds,name="Bmut")
and it gives me the same p-values (foldchange are different though) .. maybe I'm missing something . My design is slighty different as in the upper example: I've a factor B with three conditions (WT,mut1, mut2). factor A remain the cell type So:I'm not sure how that would work in the context of an LRT, why not use a wald test?
I thought if a factor has more than two conditions, it's better to use the LRT function. My bad.
Only if you don't care about the effect of each level. The LRT is asking, "Is there a difference due to B?" What I suspect you actually want to ask is, "Is there an effect of Bmut vs. BWT?", or whatever else the 3rd level of B is called. That sort of question is most easily addressed with a Wald test.
OK I understand a little bit better now. Thanks Devon.