Why we use adjusted p-value not p-value in differential gene expression analysis ?
1
0
Entering edit mode
2.1 years ago
Amr ▴ 180

Why we use adjusted p-value not p-value in differential gene expression analysis?

p-value adjusted-p-value DGE DESeq2 • 3.9k views
ADD COMMENT
1
Entering edit mode
2.1 years ago
Asaf 10k

Because we need to correct for multiple hypotheses. If you ask a lot of questions, about 5% of them will pass the 0.05 p-value threshold just by chance.

ADD COMMENT
1
Entering edit mode

One important detail is also that some adjusted p-values are not even p-values in the original sense.

For example, an FDR may also be called an adjusted p-value, but it is a different concept altogether.

ADD REPLY
0
Entering edit mode

@Istvan Thanks for your useful comment

ADD REPLY
0
Entering edit mode

Do you want to say that the adjusted p-value margin is bigger so it allows more genes to be under the hypothesis of 0.05? I mean for example, 100 genes are statistically significant less than 0.05 but for adjusted p-value calculations with Benjamini- Hochberg there will be more than 100 genes statistically significant with the same threshold 0.05? If you don't mean what I mentioned or if I am wrong, then please correct me...Thanks

ADD REPLY
1
Entering edit mode

Adjusted p-values will always be equal or larger than original p-values, so the situation is opposite to what you wrote: there will probably be less than 100 significant genes left.

ADD REPLY
1
Entering edit mode

Adjusted p-values will always be equal or larger than original p-values

that is not quite right - an FDR type of adjustment could make some p-values smaller - though I do also argue in this thread that calling an FDR a p-value is a bit of a misnomer.

But I want to make another point. If we wanted to be rigorous, there is no such thing as an adjusted p-value in the first place - what we adjust is the significance level,

For example, if the significance level is 0.05 before adjusting, that level becomes 0.005 after adjustment. So the correction should be applied to the significance levels, not the p-values.

But to make things "more clear", in typical data analysis, we end up multiplying the p-value with a factor and still using 0.05 as the significance - but to my understanding, that can be misleading when we start to interpret p-values in different contexts.

ADD REPLY
0
Entering edit mode

@liorglic & @Asaf Thanks a lot for clarification.

ADD REPLY

Login before adding your answer.

Traffic: 1477 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6