I've been using fgsea to run some enrichment analysis and one of the outputs from fgsea()
is nMoreExtreme.
I've seen from the vignette that the nMoreExtreme is:
a number of times a random gene set had a more extreme enrichment score value
I was wondering what this actually means?
Like what is a random gene set? Is it just a random selection of genes? But then would nMoreExtreme be different every time fgsea is run?
Is it another measure of how reliable the enrichment analysis is?
I am guessing that a value of 0 or 1 is good? And a value of a few thousand is bad?
Could I use it alongside a padj value to determine if the results are significant or is it not as important as padj?
Thanks in advance.
Hello, can I please ask what do you mean by no
nperm
at all in latest version offgsea
? I just installed this package in R today and got different results when runningfgsea(pathways=samplepathway, stats=ranks, minSize = 15, maxSize = 500, nperm=1000, nproc=1)
vsfgsea(pathways=samplepathway, stats=ranks, minSize = 15, maxSize = 500, nproc=1)
.Please do not use the answer field unless you have an answer to the toplevel question.
?fgsea
helps. The recent versions have a functionfgsea
which wraps bothfgseaSimple
andfgseaMultilevel
. The former is the "traditional" and computationally expensive method based on permutations. The latter is a more efficient method based on an "adaptive multilevel splitting Monte Carlo approach" (quoted fromfgseaMultilevel
) which does not require permutations (from what I understand). The fgsea preprint at biorXiv explains details, and for specific question we fortunately have alserg (the maintainer of fgsea) here for expertise.In the recent version
fgsea
has switched an algorithm that doesn't havenperm
parameter. For compatibility reasons currentlyfgsea
called withnperm
parameter executes the old version, and without -- the new one. That's why you see a big difference in results.