different DEGs with different number of sample input.
1
0
Entering edit mode
2.9 years ago

Recently, I come across an issue that when I directly compare sample A (n=3) and sample B (n=3) using DESeq2, there were only a few DEGs. However, if I add more samples (i.e. C, D, E, F..., n=3) into DESeq2 using DESeqDataSetFromMatrix() and then command results(dds, contrast=c("type","sample A", "sample B")), there will be more DEGs even though I am comparing the same sample type. May I know why this happens?

DESeq2 • 684 views
ADD COMMENT
1
Entering edit mode
2.9 years ago

This is not an issue, it is a feature. DESeq2 look at all samples to estimate dispersion, i.e., the intrinsic (within-group) variability in gene expression. More samples in the model usually lead to a more accurate dispersion estimate, which can help with finding DEGs.

Usually, it is recommended to include all samples in a given study, even if they are irrelevant to your A vs B comparison (see this manual). However, there are exception when the within-group variability is very different in one group compared to the others.

ADD COMMENT
0
Entering edit mode

I see. Thank you for your detailed explanation.

ADD REPLY

Login before adding your answer.

Traffic: 1742 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6