Dear all, I have list of KEGG IDs for metabolites (compounds) and I want to get the name of the all the compounds from that list. So please suggest me what should I do??? Thanks
Dear all, I have list of KEGG IDs for metabolites (compounds) and I want to get the name of the all the compounds from that list. So please suggest me what should I do??? Thanks
I hope this helps,
1.make a file 'sample.list' with the list of compound IDs you have,
cat sample.list
C00001
C00002
C00003
C00004
C00005
C00006
C00007
2.And then run the following CURL command below.
curl -g -s -S http://rest.kegg.jp/list/compound | grep -f sample.list | sed "s/cpd\://" > sample_extracted.table.txt
SAMPLE OUTPUT:
C00001 H2O; Water C00002 ATP; Adenosine 5'-triphosphate C00003 NAD+; NAD; Nicotinamide adenine dinucleotide; DPN; Diphosphopyridine nucleotide; Nadide; beta-NAD+ C00004 NADH; DPNH; Reduced nicotinamide adenine dinucleotide C00005 NADPH; TPNH; Reduced nicotinamide adenine dinucleotide phosphate C00006 NADP+; NADP; Nicotinamide adenine dinucleotide phosphate; beta-Nicotinamide adenine dinucleotide phosphate; TPN; Triphosphopyridine nucleotide; beta-NADP+ C00007 Oxygen; O2
I assume, I understand your question properly.
You can either search for them in the KEGG database website, or use something more automated like the KEGG REST API (in this specific case you need the "get" method).
Suppose you are interested in the following three compound IDs:
Simply go to this webpage (have a look at how the URL has been constructed): http://rest.kegg.jp/get/C00259+C00140+C00818
Dear Mgalactus, thank you for your reply. The method you suggested, gives the details of the compounds in query. but I have a list of more than 2000 metabolites IDs. In that case its very difficult to get the name and details. So is there any way to get the result output in tabular format like as in *.csv format.
Thanks in advance
like this? http://rest.kegg.jp/list/compound
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
yes something like that but for my list of IDs. Thanks
Sorry, but what you are asking for are 10 lines of code in python(maximum) and even less in R
It is more helpful to provide actual code rather than a statement about how small the needed code is.
Can you please post your sample input ?