Entering edit mode
5.4 years ago
kdc15
▴
40
I am trying to run GSEA on a list of differentially expressed genes (control vs. treated) however, I would like to perform this against a list of genes that I provide. I have converted my gene list to GRP file format, and I have also run the list through HUGO and filtered it to make sure that they are approved symbols. However, when I run the analysis, I immediately get the error: Could not find feat index for: -1 SELENOH
Everytime I run this, it just changes the gene name that it cannot find an index for.
Double check that you have the correct format for GRP by looking HERE.
If not GRP, you can also specify your gene sets with GMT format, as I show here: A: Running GSEA for DEGs
Thank you for your response. I have double checked the GRP format. It seems to be in order (one gene per line, HUGO approved etc.). Will the GMT format be applicable as I only have one gene set?
Yes, it should work for just 1 gene set, too.
Even when I format to GMT, I still get the same error
Please paste a sample of your data. Check for hidden encoding, i.e., if you have transferred your files between a Windows- and Linux-based OS.
according to the GMT requirements, I organised my genes across like this
my dataset in the .gct format is organised as follows:
I don't have my own original files with me. However, one important feature is that those should definitively be tabs between each column. If they are spaces, it won't work (I believe). Between the 2 numbers on your second line, that may have to be just a single space - not sure. The format spec is very specific, though.
Ping this thread again if that doesn't solve it, and I will pick it up later tonight when I am home.
I had saved the file as tab-delimited and then added the .gct extension. Is it not sufficient to do it this way?