Question

Taxonomic analysis of metagenomic data

1

Entering edit mode

7.2 years ago

bird77 ▴ 80

How can I quickly calculate the taxonomic distribution of a metagenome (assembled or unassembled data)?

sequence • 2.0k views

ADD COMMENT • link updated 4.9 years ago by Shicheng Guo ★ 9.5k • written 7.2 years ago by bird77 ▴ 80

score 2 · Answer 1 · 2017-09-28

2

Entering edit mode

7.2 years ago

Brian Bushnell 20k

You can do this very quickly (in a few seconds) with BBMap's Sketch tool, which will compare the data to RefSeq:

sendsketch.sh in=reads.fq reads=4m depth

or

sendsketch.sh in=contigs.fa

ADD COMMENT • link 7.2 years ago by Brian Bushnell 20k

score 1 · Answer 2 · 2017-09-28

1

Entering edit mode

7.2 years ago

Sej Modha 5.3k

There are a number of k-mer based taxonomic tools available: https://omictools.com/taxonomy-dependent2-category

ADD COMMENT • link 7.2 years ago by Sej Modha 5.3k

score 0 · Answer 3 · 2020-01-18

What do you mean by Taxonomic analysis? Do you mean taxonomic binning of the sequences or profiling of the sequences?

I wrote a tool in graduate school that uses k-mers and an optimization method for taxonomic profiling of a metagenome dataset in seconds. The tool is named FOCUS and you can learn more about it here. The page also teaches more details about binning and profiling in case you are interested,

score 0 · Answer 4 · 2020-01-19

0

Entering edit mode

4.9 years ago

Shicheng Guo ★ 9.5k

How about fastq_screen and Diamond?, I prefer Diamond with my experience.

fastq_screen : https://www.bioinformatics.babraham.ac.uk/projects/fastq_screen/_build/html/index.html

Diamond: https://github.com/bbuchfink/diamond and DIAMOND_analysis_counter.py (SAMSA2)

ADD COMMENT • link 4.8 years ago by Shicheng Guo ★ 9.5k

1

Entering edit mode

DIAMOND is simply an aligner. If you don't refer to a tool that does not have a database associated to it, it does no good. Maybe you can add some databases and tools which take the DIAMOND output such as MEGAN which takes the DIAMOND output when aligning against the NR/NT database and can give you back the taxonomic and functional analysis.

I talk here more about DIAMOND and Rapsearch2.

ADD REPLY • link 4.8 years ago by onestop_data ▴ 330