I'm interested in constructing a gene module of genes that are co-expressed with gene A. To define this module, I calculate the correlation of gene A with all other genes, producing a vector of correlation coefficients:
R = {ra, rb, rc, ...}
I think this module should be enriched in a particular pathway, which has the following set of genes:
G = {g1, g2, ..., gn}
When I test this hypothesis with GSEA, it is very significant. However, there are too many "leading edge" genes. What I would like to do is produce a list of genes that are statistically enriched in this module, above a certain level of confidence.
Is there a way to do this?