I've come across many posts about common errors using GAGE, and many of these common pitfalls relate to mismatching ID systems (Entrez gene ID, gene symbol, etc). I've read the "Gene set and data preparation" vignette, but still get errors when I try to convert my gene symbols to Entrez IDs.
I have two questions:
- Is there a way to map more "efficiently" gene symbols to Entrez IDs? For example, of 38720 unique input IDs, 8850 of my genes remain unmapped. I am using the mouse data set, trying to map gene symbols in my featureCounts output.
- What does it really mean when I fail to download xml/png files for my GAGE analysis? I get errors like:
Info: Downloading xml files for hsammu04060, 1/1 pathways..
Warning: Download of hsammu04060 xml file failed!
This pathway may not exist!
Thanks in advance
I don't know if it is the cause of all your problems, but you should be using
species = "mmu"
on your pathview() call.Thanks! That solved the errors. I am still unable to completely map all the gene symbols, do you have any suggestions?
No, I do not have any (easy) suggestions. In fact, the situation is probably worst, if you use
org.Mm.eg.db
and do:you will probably find a "1:many mapping", indicating some gene names have multiple IDs. See here and here for discussions and suggestions.