GO and KEGG enrichment analysis for non-model organism
1
0
Entering edit mode
2.1 years ago

I have obtained a list of genes after differential gene expression analysis with DESeq2. I am working on the genome of a non-model fungal organism. I also have the GO terms associated with these DEGs from interpro analysis. But for carrying out GO and KEGG enrichment analysis, standard database IDs like Ensembl IDs are needed. Whereas, in my list, the gene IDs are g3041, g2134... as given out by the gene prediction tools. Can someone please explain how I can use my gene IDs to carry out the GO and KEGG enrichment analysis steps?

Thank you.

Enrichment-analysis GO KEGG Fungus DESeq2 • 2.1k views
ADD COMMENT
2
Entering edit mode

You need to use a tool like eggNOG to associate your genes with KEGG orthologs, then perform GO enrichment on the ortholog set using a tool like topGO.

ADD REPLY
3
Entering edit mode
2.1 years ago
h.mon 35k

You don't necessarily needs Ensembl identifiers. You need to create an object with the gene to GO / KEGG mapping. Different packages will use different structures. You can create the file outside of R and read it, or you can create this mapping directly in R.

As several packages can read files in the gmt format (e.g., read.gmt() from clusterProfiler, or GSA.read.gmt()from GSA, and so on), what I usually do is create a gmt file from the annotation, then read this gmt to perform the gene set enrichment.

ADD COMMENT

Login before adding your answer.

Traffic: 1693 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6