Question

Deseq same padj values for a lot of genes but with different p-values

2

Entering edit mode

3.5 years ago

yanyanwu ▴ 20

I used DEseq to find differentially expressed genes between two samples, for each sample i have 3 replicates, for the DEseq result, i got exactly same padj value for a lot of genes but their p-values are different as attached, is this normal ? enter image description here

Deseq RNA-seq • 3.1k views

ADD COMMENT • link updated 2.9 years ago by jared.andrews07 ★ 18k • written 3.5 years ago by yanyanwu ▴ 20

score 4 · Answer 1 · 2021-05-13

4

Entering edit mode

3.5 years ago

jared.andrews07 ★ 18k

To expand on German's answer, this is due to how adjust p-values are calculated with the default (Benjamini-Hochberg) adjustment procedure in R. In short, the p-values are ranked from smallest to largest, and those ranks become part of the calculation.

This Stats SE post explains it nicely, but in short, each unadjusted p-value is multiplied by the number of tests and then divided by its rank order. When p-values are particularly close to each other, this can lead to a more lowly ranked unadjusted p-value ending up with a smaller adjusted p-value than the one before it. In these cases where the resulting sequence is non-decreasing, the preceding p-value is changed to the subsequent one such that they are the same. This is what you're observing in your results, and it's normal.

ADD COMMENT • link 3.5 years ago by jared.andrews07 ★ 18k

1

Entering edit mode

if you want the really short, terribly oversimplified way I think of this, it's, "FDR is rank based, which introduces quantization"

ADD REPLY • link 3.5 years ago by glocke01 ▴ 190

0

Entering edit mode

In this case, should I still use the adjusted p-values, or I need to switch to the p-values for the filtering instead?

ADD REPLY • link 2.9 years ago by Libin • 0

1

Entering edit mode

The adjusted p-values are still perfectly valid, and I would not recommend filtering on un-adjusted p-values for DE analyses.

ADD REPLY • link 2.9 years ago by jared.andrews07 ★ 18k

score 3 · Answer 2 · 2021-05-13

3

Entering edit mode

3.5 years ago

German.M.Demidov ★ 2.9k

Yup, this is fine. FDR shows the expected proportion of false discoveries at this threshold and sometimes for different p-values the expected FP proportion is the same for different p-values.

ADD COMMENT • link 3.5 years ago by German.M.Demidov ★ 2.9k