NOOB education needed for microbiome functional analysis (specifically PICRUSt2).

0

Entering edit mode

5.5 years ago

ariel ▴ 250

I give picrust2 a list of 16s ASVs that map to biota at various taxanomic levels. Picrust2 creates a table of inferred per-sample abundances of KEGG genome orthologs (KO) and one of KEGG enzyme classifications (EC). The main output is a list of per-sample inferred pathway abundances that seems to take both of these into account.

Could I have a basic description of how this works and why? I did RTFM, but I'm inexperienced with pathway analysis and totally green in the context of the microbiome. So, while I can recite the steps picrust2 uses to go from ASVs to pathways, I humbly admit that I don't really understand what I'm doing or what this information ultimately means.

microbiome 16s functional-analysis picrust • 3.3k views

ADD COMMENT • link 5.5 years ago by ariel ▴ 250

1

Entering edit mode

Which part of the process are you having problem with? Taxa -> KO or KO -> pathways?

ADD REPLY • link 5.5 years ago by Asaf 10k

0

Entering edit mode

Both, I think. Going from Taxa -> KO, am I ending up with all possible gene orthologs in that specific organism, or is it the orthologs represented across all the samples in the pool? Maybe that is the difference between "stratified" and "unstratified?" Then, is it just KO -> pathway, or is it KO + EC -> pathway. Otherwise, what is EC for?

ADD REPLY • link 5.5 years ago by ariel ▴ 250

0

Entering edit mode

KEGG pathways contain more than enzymes and are built using KOs. I guess EC numbers are there for you if you want to use them but not used to construct pathways. Although EC<->KO is pretty tight and should lead to same pathways.

ADD REPLY • link 5.5 years ago by Asaf 10k

1

Entering edit mode

After reading more docs, it turns out the picrust2 ONLY uses the EC numbers and gets the pathways from the MetaCyt database. It gives me the KO numbers, but does not match them with KEGG pathways.