I ran KOFAMSCAN and everything looks good. Trying to go from KEGG orthologs to KEGG modules. Now I'm seeing that certain orthologs just don't have any modules and I'm trying to figure out how KEGG is structured. For example, there is this KEGG ortholog: K16148
I see there are 2 KEGG pathways but why isn't this enzyme associated with any KEGG modules?
When I click on one of the pathways ko00500, I see there are 3 modules in this pathway M00854, M00855, and M00565 but why don't they have K16148
?
I just need a way to group KEGG orthologs into higher level categories that's consistent.
I made a venn diagram of the following:
- The JSON file from here https://www.genome.jp/kegg-bin/get_htext?ko00001.keg (Accessed 2021.06.22)
- The KEGG ortholog hits I got from KOFAMSCAN using the most recent database
- The BioPython implementation of this: How to get a mapping between KEGG module and KEGG orthologs? (Accessed 2021.06.22)
The simplest explanation is that K16148 is not involved in M00854 (Glycogen biosynthesis), M00855 (Glycogen degradation), M00565 (Trehalose biosynthesis). From KEGG: