GSEA Analysis: rank genes by fold change, t-statistic, U-statistic or KS-statistic?
0
0
Entering edit mode
8.4 years ago
Juan Cordero ▴ 140

Dear community,

in order to find significant enrichment in certain gene sets for my microarray data (2 conditions), I have tried out with different methods to calculate the gene scores, yielding different results in the number of enriched gene sets. In this way, the methods with the decreasing number of enriched gene sets (q-value < 0.05) are, for KEGG pathways: KS (81) > U-statistic (32) > FC (14) > t-statistic (0). The U-statistic 32 gene sets include the FC 14 gene sets, and the KS-statistics 81 gene sets include in turn the U-statistics ones. The intersection of KS, U and FC results equals FC. It might be possible KS-statistics yields many false positives.

I would be very very grateful if somebody could give me a piece of advice on this, since I do not know which method should be the most appropiate.

I am using gage package in R by the way.

Thanks

microarrays gsea ranking • 3.8k views
ADD COMMENT
2
Entering edit mode

Different scores do different things. Try to understand how they relate to the question you're interested in e.g. if you're most interested in effect size then the fold change is what you should use but if you're more interested in statistical significance then look for one of the statistics taking into consideration the assumptions they make e.g. t-test assumes normally distributed data whereas KS doesn't.

ADD REPLY

Login before adding your answer.

Traffic: 2153 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6