Suppose I have an RNA-seq experiment in which I have 5000 deferentially expressed genes. In which around 120 are cancer causing genes or genes that have been found to be related with cancer (using information from Cosmic. Would it be a valid approach to use the 5000 genes as a background and do GSEA on 120 genes. If not why?
I think that it would be valid