Kegg Id Vs Cog Id, And The Best Method For Large Batch Id Assignment?
1
0
Entering edit mode
11.5 years ago
JacobS ▴ 990

I have a large RNA-Seq dataset with reads of about 200bp long. I've already used other tools to annotate these reads with gi numbers, but now want to associate them with KEGG/COG IDs for pathway analysis. Can someone please help me understand the difference (and different uses) for KEGG and COG IDs and what is the best way for large batch (on the scale of millions) annotation?

Thanks!

kegg annotation pathway • 11k views
ADD COMMENT
4
Entering edit mode
11.5 years ago
Neilfws 49k

Not a complete answer to your questions, but with regard to understanding the differences:

COG was a NCBI project to classify proteins from sequenced genomes. It is no longer maintained and you should probably not use it. If you need to know more:

KEGG is an altogether larger, actively-maintained project. You might think of it as an attempt to create a systems biology database. They use their own annotation and clustering pipeline to assign IDs, called the KEGG Orthology system. Here's a key KEGG publication.

Many software tools have been built around KEGG, for example in R/Bioconductor.

ADD COMMENT
0
Entering edit mode

Great, very helpful post!

ADD REPLY
0
Entering edit mode

@ Neilfws, I see this post is 2-year-old. I was wondering could you update with new perspectives. I was going through KEGG Vs COG, I am finding this publication PMID-25428365. So now, is it better to use KEGG or COG? Thanks.

ADD REPLY
0
Entering edit mode

Just to add an update for anyone who happens to find this topic, there was a COG update (2021):

Galperin, Michael Y., Yuri I. Wolf, Kira S. Makarova, Roberto Vera Alvarez, David Landsman, and Eugene V. Koonin. "COG database update: focus on microbial diversity, model organisms, and widespread pathogens." Nucleic Acids Research 49, no. D1 (2021): D274-D281.

ADD REPLY

Login before adding your answer.

Traffic: 2799 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6