High inconsistency between edgeR and DESeq2
1
1
Entering edit mode
3 months ago
lyan125 ▴ 10

Hi

I returned to TCGA raw counts (downloaded from xena) and performed a differential expression analysis between mutation x to "WT", once with DESeq and once with edgeR. I keep default settings. I know the statistics are different (one using geometric the other log-ratio based). However, I got 24k significant results (padj <0.05) with DESeq2 while only about half with edgeR (FDR <0.05). In addition, there are about 7500 common significant genes. The direction (logFC) of the significant genes is similar and almost perfectly matched but not the power.

I noticed that many of the top significant features in the edgeR analysis are related to ribosomal RNA genes, and most of them are marked as "NA" or insignificant in the DESeq2 analysis. There is logic behind those results; it seems DESeq2 is treating them as "noise." Conversely, some results that are expected to be highly upregulated—and indeed are upregulated in DESeq2—show weak statistical strength in edgeR (by logFC)

It's important to note that the comparison is unbalanced (one group has four times as many people as the other group) but both higher than 30**

I would appreciate any suggestion about what should I do further

Best

log-ratio edgeR GLM DESeq2 • 449 views
ADD COMMENT
2
Entering edit mode
3 months ago
bk11 ★ 3.0k

Please check the post here. It has been discussed very explicitly before. Hugely different results between edgeR and DESeq2

ADD COMMENT

Login before adding your answer.

Traffic: 1831 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6