How can I identify the genes responsible for producing specific products in KEGG pathways?
For instance, in the Tryptophan metabolism pathway, tryptophan [C00078] is converted to indole [C00463] by the enzyme 4.1.99.1, encoded by the gene tnaA (K01667).
My question is: Is it possible to determine, across all KEGG pathways, the products generated by the action of specific genes, as seen in the case of the tnaA gene in Tryptophan metabolism? If so, how can this be achieved using R or python?
not sure if there are any of the shelf tools that will do this but those numbers correspond (or are) EC-numbers. Those you can look up in the EC database which will give you a name (or rather catalytic function of an enzyme). That specific reaction type enzyme you can then look for in your species of interest.
With a bit of programming you likely can whip something together to achieve this automatically.
note: it will not be as easy as stated above for all genes/reactions as there are still many blanks for a lot of species.
Not R or Python but using the KEGG REST API you can do something like this. You question also goes back and forth about gene names and EC numbers etc.
Note: KEGG requires a license for extensive use so this may not work for a large number of queries. Please respect KEGG's acceptable use policy.
Get the reactions associated with the EC number
Then use the reaction number to identify the products (output truncated for display here)