Hello
I have performed a differential expression analysis with DESeq2 in R. I obtained a list of differentially expressed genes. These genes are identified by ensembl gene id (ENSG00....).
I want to use these results for gene-set enrichment analysis and pathway analysis in R. However, almost all of the packages I encountered to perform such an analysis require the genes to have entrez gene id. I know that you can use biomaRt to construct a table to convert all these identifiers to each other. I observed that there is no unique mapping between ensembl and entrez id. Some ensembl is map to several entrez id and vice versa and that some ensembl id map to no entrez id.
That made me wonder if it "safe" to convert gene sets with genes with entrez id to gene sets with genes with ensembl id.
Other suggestions to fix this are also welcome. I already tried filtering out all genes with no entrez id and that have a duplicate ensembl or entrez id before the differential expression, but then I lost some differentially expressed genes.
Thanks in advance.
Tim
Have you tried WebGestaltR? I know WebGestalt (the online version) works well with both Entrez and Ensembl IDs, so it makes sense that the R package would as well.
Disclaimer: I previously worked for the team that developed Webgestalt.