Question

High inconsistency between edgeR and DESeq2

1

Entering edit mode

8 months ago

lyan125 ▴ 10

Hi

I returned to TCGA raw counts (downloaded from xena) and performed a differential expression analysis between mutation x to "WT", once with DESeq and once with edgeR. I keep default settings. I know the statistics are different (one using geometric the other log-ratio based). However, I got 24k significant results (padj <0.05) with DESeq2 while only about half with edgeR (FDR <0.05). In addition, there are about 7500 common significant genes. The direction (logFC) of the significant genes is similar and almost perfectly matched but not the power.

I noticed that many of the top significant features in the edgeR analysis are related to ribosomal RNA genes, and most of them are marked as "NA" or insignificant in the DESeq2 analysis. There is logic behind those results; it seems DESeq2 is treating them as "noise." Conversely, some results that are expected to be highly upregulated—and indeed are upregulated in DESeq2—show weak statistical strength in edgeR (by logFC)

It's important to note that the comparison is unbalanced (one group has four times as many people as the other group) but both higher than 30**

I would appreciate any suggestion about what should I do further

Best

log-ratio edgeR GLM DESeq2 • 695 views

ADD COMMENT • link updated 8 months ago by bk11 ★ 3.1k • written 8 months ago by lyan125 ▴ 10

score 2 · Answer 1 · 2024-09-19

2

Entering edit mode

8 months ago

bk11 ★ 3.1k

Please check the post here. It has been discussed very explicitly before. Hugely different results between edgeR and DESeq2

ADD COMMENT • link 8 months ago by bk11 ★ 3.1k