Gene abundance profile from metagenomics data
2
0
Entering edit mode
7.6 years ago
932477002 • 0

Hi, alls

I was dealing with a metagenome dataset.

After prediction, I have created a catalogue consisting of all genes found from the dataset, and now I want to extrcat gene profiles (i.e. a list of genes with relative abundance).

Do you know what tools can be used to profile the gene abundance list?

I would apperciate your kind help!

gene • 3.0k views
ADD COMMENT
0
Entering edit mode
7.6 years ago
Asaf 10k

I'm not aware of such a tool. You can try and classify your genes using interpro domains, MetaCyc/KEGG orthology groups etc. Depending on your depth of sequencing and sample complexity I would consider reference-based counting (i.e. mapping the reads to a reference using blastx) instead of assembly based approached you took.

ADD COMMENT
0
Entering edit mode
6.4 years ago

There are two approaches to do it.


  1. I suppose you want to get only abundances of genes irrespective of organisms. Then can cluster all genes with CD-HIT or other tool and then take representative from each cluster to find KO or COG ids/description.
  2. Second is, use UProC to classify your genes and get KO ids. Then you can simply consolidate your resutl. (count of KO's)

Other than these approach, you can assemble your metagenome and map query genes to each contig (which you will get). Then multiply number of hits with depth of sequencing. It is not accurate way. It will give only relative abundances only.

ADD COMMENT

Login before adding your answer.

Traffic: 2084 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6