Is there a library in Python for doing gene set enrichment analysis (GSEA, http://www.broadinstitute.org/gsea/index.jsp)? A library/tool that takes a foreground set of genes, a background set, a gene sets database like the one available from Broad and does the analysis without relying on microarray-specific items like probe IDs? All the tools I found are made for microarrays but I want to do this for rna-seq. If there's no library in Python, is there a command line tool that can do this again without relying on probe IDs?
which existing tools would you use, apart from GSEA?
Just set some cutoffs and run hypergeometric tests by computing overlaps using python sets and calculating significance with scipy.stats.hypergeom and statsmodels.sandbox.stats.multicomp