Processing KAAS output (lists of KO numbers) into straightforward knowledge
2
1
Entering edit mode
8.8 years ago
antortjim ▴ 10

Do you know any resource or software (Bioconductor, Biopython, etc) that can be employed for summarising KAAS output?

While conducting a mock research following the Trinity-Transdecoder-KAAS pipeline based on the following paper. I have submitted the .pep file generated by Transdecoder to the Kegg automatic annotation service (KAAS) and this has generated a list of KO (Kegg Orthology) numbers. Whereas I find it useful, I still don't see the point of such raw data. I would like to have it further summarised, for example, it would be great if I could get the number of pathways involved in each category (20 metabolism pathways, 30 genetics pathways and so on) or something like that. Really, what I would like is to perform a gene set analysis with this list of KO numbers and expect to see categorization. Correctly organized, this data could be plotted (pie plots for instance) in order to gain an insight into the kind of pathways and biological entities present in the sample.

I have made some research on biostars, and similar questions like this, this, this or this have been asked, but they either fail to answer to this question, or the proposed answer does not work anymore. I have also searched the Kegg website and I haven't found anything better than the KEGG Reconstruct Pathway, which renders a count of pathways within each subcategory that nevertheless has to be painstakingly parsed from html.

To sum up, I want to go from a huge KO numbers list to general statistics and counts. How can I get done? Thanks in advance.

kegg kaas • 4.3k views
ADD COMMENT
0
Entering edit mode

It's not KO, it's K0 K1 etc.

ADD REPLY
0
Entering edit mode
8.8 years ago
Naren ▴ 1000

Put those K numbers into the input box at bottom of the KEGG orthology search page . Try with less quantity first.

ADD COMMENT
0
Entering edit mode

Indeed, it won't work in my computer if I input a list ok KEGG Orthology numbers bigger than 100. However I tried with less quantity and it did work, though none returned what I wanted. The one that came closest was the Map pathway option, which returns the name of every pathway along with the number of hits found on each. I would like to have the number of hits in pathways categories (i.e Metabolism, Genetic Information, Diseases, etc). Would that be possible? Thanks

ADD REPLY
0
Entering edit mode

Yeah you can map those to catagories from this reference file

Available at KEGG Orhology website.

ADD REPLY
0
Entering edit mode
3.8 years ago
Arsenal ▴ 160

I have parsed two main files from kegg (one from KO IDs and the other from pathway IDs) into categorized tables. I then use these tables to map my kaas output into the kegg pathway levels/categories. I am thinking in writing a simple script to help this kind of process you mentioned, which I do often.

ADD COMMENT

Login before adding your answer.

Traffic: 2368 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6