Entering edit mode
4.6 years ago
tianshenbio
▴
180
I am doing KEGG enrichment analysis for a non-model organism. First I annotate the genome using KAAS and I got ko numbers associated with each gene (kegg2gene
)
K11968 BANY.1.2.t00001
K04834 BANY.1.2.t00003
K13273 BANY.1.2.t00007
K04497 BANY.1.2.t00010
K09210 BANY.1.2.t00011
K10360 BANY.1.2.t00012
K10360 BANY.1.2.t00013
Then I have a list of DE genes (gene
) and performed KEGG enrichment using clusterprofiler:
x <- enricher(gene,TERM2GENE=kegg2gene,pvalueCutoff = 0.05, pAdjustMethod = "BH",qvalueCutoff = 0.05)
Then I got a list of enriched ko numbers, then how do I translate the ko numbers to pathway names/descriptions?
I know I can add TERM2NAME=kegg2name
, but how do I get a list of all kegg numbers vs names?
https://www.genome.jp/kegg/ko.html
Paste them into the box on that page and translate to pathways.
Hi, Since I hope to do pathway enrichment analysis, I need to get a list of genes vs pathways. Now I have genes vs K numbers so yes, I need to translate K numbers to pathways. But I can't download a list of K number vs pathways from the mapper page u shared....
You don't need to translate them to use clusterProfiler, you can use KOs directly. set organism = "ko"
https://github.com/YuLab-SMU/clusterProfiler/issues/99
Hi NRC
I read that post from github but only
enricherKEGG
has theorganism
option right? As I mentioned I used theenricher
function notenrichKEGG
since I am using a customized dataset. You can also have a look at the last comment from the link you shared, that's written by me.I am not sure about enricherKEGG, I am only familiar with enrichKEGG. You can use a list of KEGG IDs directly, e.g. K00011, K00678, etc. You set organisms="ko" and that's it, this definitely works for non-model organisms because all you need is a list of KO values
Hi NRC Thank you for your reply. Sorry, I meant
enrichKEGG
. First of all, I have a list of all kegg2gene for my species:kegg2gene
Then I have a list of DE genes:
gene
How do I perform the kegg enrichment analysis?
I tried this:
and I got an error message:
Match your list of DE genes to their KO value, then try again.
x <- enrichKEGG(gene, organism='ko', keyType='kegg', universe = kegg2gene, pSjustMethod="BH")
Hi, I am not sure if I understood correctly. I tried to map my DE genes with K numbers and used the K number only, now
gene
(a string of characters) is like:Also, only the K numbers in kegg2gene are used, now kegg2gene looks like:
This gave me
hmm what if you take out the p adjusted cutoff?
I tried to adjust p and minGSSize but did not work...I have 300 terms in gene and 9000 terms in kegg2go, should be enough for the analysis...Anyway, thank you for your help!NRC
Hi! How you solved your problem using enrichKEGG( ) function? I have the same error message:
ca_list is my list of DE gene ID's and BBRB_KEGG is a dataframe of two columns with gene ID's and KEGG annotations that I get with Trinotate.
If you can help me I'll really apreciate!