Tutorial:Tools For Metagenomic Data Analysis
18
126
Entering edit mode
12.0 years ago
vijay ★ 1.6k

Dear All,

I just thought it would be worthwhile posting this , as it would be beneficial for those who are stepping into field of metagenomics . Below are the compiled list of tools and links to a few of them, that can be used for primary and secondary analysis of metagenomic sequence data.(Though there are plenty missed out here..)

Please do add to this list, as the list will be ever growing...:)

Metagenome assembly

Gene calling

Microbial diversity Analysis

Binning

Composition based binning

Sequence similiarity based binning

Functional Annotation

Comparative Metagenomics

Mapping to reference genome

Online tools for NGS data analysis

  • parallel Meta see
  • Sort-Items
  • PANGEA

Commercial

  • CLC bio genomic workbench
  • ERA-7

Quality analysis

  • FastQC
  • Prinseq
metagenomics next-gen • 64k views
ADD COMMENT
2
Entering edit mode

It seems that many links did not work anymore. Maybe some updates will help. Thanks.

ADD REPLY
0
Entering edit mode

This an awesome list!

ADD REPLY
0
Entering edit mode

Galaxy is convenient

ADD REPLY
0
Entering edit mode

Hi,

I am working on Metagenomics data. I am trying to predict promoters after assembly. I would appreciate if someone can suggest any good tool.

Thanks
Ali

ADD REPLY
0
Entering edit mode

Can you share some mature analysis pipeline to solve the most common metagenomic questions?

ADD REPLY
0
Entering edit mode

Thanks Vijay, you shared several different metagenome assembly? which one is the most welcomed?

ADD REPLY
0
Entering edit mode

Hi Vijay, Some collection to toxic genes/transcript from all the metagenomics references?

ADD REPLY
7
Entering edit mode
10.2 years ago
Tim ▴ 130

Just to add my two pennies worth - pipelines for analysis of viromes:

VIROME: classification of predicted open-reading frames (ORFs) from viral metagenomes

METAVIR/METAVIR 2: tools for viral metagenome comparison and assembled virome analysis

Take a look at VMGAP and ProViDE as well.

ADD COMMENT
5
Entering edit mode
11.4 years ago
sarahhunter ▴ 600

May I add a plug for http://www.ebi.ac.uk/metagenomics? We have developed a pipeline for taxonomic and functional analysis of metagenomic samples and we also archive your raw sequence data in the SRA on your behalf. We're always looking for suggestions of how to improve our service, so please contact us with feedback. We will be launching new features on our website later next month and hope to publish the resource in NAR database issue this year.

ADD COMMENT
4
0
Entering edit mode

Is FANTOM alive? Their download and manual links aren't working for me

ADD REPLY
0
Entering edit mode

Yes, very much alive: http://fantom.gsc.riken.jp/

ADD REPLY
0
Entering edit mode

Looks like this is a different FANTOM. The first FANTOM is a functional and taxonomic analysis of metagenomes. In contrast, the other one is FANTOM (Functional ANnoTation Of the Mammalian genome) in the fields of transcriptome analysis. Please correct me if I was mistaken.

ADD REPLY
4
Entering edit mode
ADD COMMENT
2
Entering edit mode
10.1 years ago

SURPI: Pipeline for pathogen identification from complex metagenomic NGS data.

http://chiulab.ucsf.edu/surpi/

http://genome.cshlp.org/content/early/2014/05/16/gr.171934.113.full.pdf+html

ADD COMMENT
2
Entering edit mode
9.0 years ago
frcamacho ▴ 210

Cool new profiler tool for metagenomic data, ShortBRED.

ShortBRED is a pipeline to take a set of protein sequences, group them into families, extract a set of distinctive strings ("markers"), and then search for these markers in metagenomic data and determine the presence and abundance of the protein families of interest.

ADD COMMENT
2
Entering edit mode
ADD COMMENT
1
Entering edit mode
9.7 years ago

Hi,

I recently find these two software pretty useful :

Gene Calling:

prodigal: http://prodigal.ornl.gov/server.html

pathway and function prediction:

picrust: http://picrust.github.io/picrust/

ADD COMMENT
1
Entering edit mode
8.1 years ago
glihm ▴ 660

In the category "Microbial Diversity Analysis":

A very nice and frequently updated program in order to pick OTU from 16S rRNA gene amplicons sequencing: USEARCH (http://www.drive5.com/usearch/).

It is not user-friendly (command line), but the documentation is nice and the tools well adapter and flexible.

ADD COMMENT
1
Entering edit mode
6.8 years ago

Check this out: https://omictools.com/metagenomics-category

They manage a curated list of bioinformatics tools.

From their site: "OMICtools strives to accelerate research in bioinformatics, making tools accessible to everyone and offering a stimulating work environment to assist life scientists extracting new findings from the omics data."

ADD COMMENT
1
Entering edit mode
6.8 years ago
vmicrobio ▴ 290

Have a look on Mash Screen, I found it usefull and easy to use!

ADD COMMENT
1
Entering edit mode
6.6 years ago
hmurray ▴ 30

You can try gaia.sequentiabiotech.com for free

ADD COMMENT
1
Entering edit mode
6.5 years ago
ricardofull ▴ 10

16S Metagenomics:

in R

  • DADA2 to Filter, quality analysis, trimming, Merge Runs, Remove Chimeras, Assign Taxonomy Phyloseq to
  • Phyloseq to a deeper analysis of metagenomic data.
ADD COMMENT
1
Entering edit mode
6.4 years ago
apaytuvi ▴ 40

Gaia for analyzing both amplicon and shotgun metagenomics (including metatranscriptomics). Everything's done online.

ADD COMMENT
1
Entering edit mode

Please mention that your tool is commercial and not FOSS.

ADD REPLY
0
Entering edit mode

Great tool! I really recommend it.

ADD REPLY
1
Entering edit mode
6.3 years ago
f.a.galkin ▴ 40

I really liked using Centrifuge for WGS taxonomic profiling.

It is resource efficient (has a 6GB index for all bacteria, compared to Kraken's over 100GB) and produces an abundance table when it's finished.

ADD COMMENT
1
Entering edit mode
6.3 years ago
MetaTK ▴ 20

16S rRNA metagenomics:

I tried DocMind Analyst on the AWS cloud and found it very convenient and easy to use. It works with a graphical user interface and uses mothur and RDP for read processing and classification. Their tutorials are also very good for beginners.

ADD COMMENT
0
Entering edit mode

You seem to be advertising docmind analytics. I'd recommend you create a Tool type post and explain this tool, including its pricing strategy. The website is confusing on what is free and what their pricing system is. Please do not bulk-add answers recommending this tool.

ADD REPLY
0
Entering edit mode

Thanks for your comment, Ram. I am not affiliated with the company but I know the owner. So I guess I have a conflict of interest. However, I really like his idea and software, particularly for non-expert in computer science (I consider myself as such). That's why I am positive about it. I have forwarded your recommendation regarding a tool type post and publishing a clear pricing policy.

ADD REPLY
0
Entering edit mode

Thank you for doing that, it's the right way to go about it! I look forward to learning more about this tool.

ADD REPLY
0
Entering edit mode

lots of resources to look for ..so a simple question this should work both whole genome as well as 16s RNA meta-genomic analysis ?

ADD REPLY
0
Entering edit mode

At the moment it works only for 16S rRNA sequencing. But soon shotgun metagenomics analysis will be implemented. I will submit a little overview over all its functions soon in the tool section.

ADD REPLY
0
Entering edit mode
3.7 years ago
juanjo75es ▴ 130

For virome assembly, the best results are obtained with s-aligner. It gets at least 64% larger NG50 than the second best.

enter image description here

Source: "s-aligner: a greedy algorithm for non-greedy denovo genome assembly"

ADD COMMENT

Login before adding your answer.

Traffic: 1790 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6