Question

p-values in GSEA

0

Entering edit mode

7.5 years ago

vivien.wee16 • 0

Hi,

When I was performing my analysis with GSEA, I noticed that the p-value changed (by ~0.005) when I changed the order of genes in my gene set. I know that the order of genes in the gene set doesn't matter but does anyone have any idea about the reason to the fluctuation of p-value?

Thanks!

GSEA • 4.4k views

ADD COMMENT • link updated 6.7 years ago by Biostar 20 • written 7.5 years ago by vivien.wee16 • 0

score 4 · Accepted Answer · 2017-07-14

4

Entering edit mode

7.5 years ago

Santosh Anand 5.8k

Are you using the same random seed?

Why are my results different from yours when I analyze the example datasets using GSEA?

You are using a different random number generator (for sample permutation) and different seeds for that random number generator, so the resulting numbers are different. However, these differences should be VERY SMALL and the IDENTITY of the top (up or down) gene sets should be pretty much the same. The FDRs might be at most a few percent different from run to run. To get exactly the same result from run to run, specify the random number seed (its a parameter in the gsea software).

ADD COMMENT • link 7.5 years ago by Santosh Anand 5.8k

1

Entering edit mode

Just a clarification: the p-values did not change because of the order of genes, but because of a different random number seed. Try running the same analysis on the same set without changing the order, and the p-values will be different again.

ADD REPLY • link 7.5 years ago by Giovanni M Dall'Olio 28k

0

Entering edit mode

Moderator note: I've moved this from a comment to a reply, so it can be voted as a proper answer.

ADD REPLY • link 7.5 years ago by Giovanni M Dall'Olio 28k

0

Entering edit mode

Got it! The random number seed was set as default instead of a specific integer. Thank you!

ADD REPLY • link 7.5 years ago by vivien.wee16 • 0