Taxonomic assignment of bins found by Maxbin2
1
0
Entering edit mode
5.1 years ago
Hansen_869 ▴ 80

Hi there!

I have been running Maxbin2 to generate bins from metagenomic data files (bacteria). The output is a series of bins (in fasta), but with no taxonomic assignment. I need a tool to annotate these bins, into to their respective organism names. I am running my tools in Snakemake, and thus need a command-line tool.

So far, I've been looking at MEGAN, metabin and cat/bat. MEGAN only has a graphic interface. Metabin is impossible to find data on how to use. Cat/bat requires me to download the entire database of taxonomy files, which is too huge for my PC.

Ideally, I need a tool that annotates the bins, is command-line based and doesn't require me to download huge files, but rather use an online database.

Thanks!

Maxbin2 bins taxonomy • 1.9k views
ADD COMMENT
0
Entering edit mode

Would metaphlan2 be an option?

http://huttenhower.sph.harvard.edu/metaphlan2

ADD REPLY
0
Entering edit mode

Thanks for your answer, however I fail to see where it says metaphlan2 can be used with bins. Looks like it is only for a collecting of reads?

ADD REPLY
0
Entering edit mode
5.1 years ago
Mensur Dlakic ★ 28k

Ideally, I need a tool that annotates the bins, is command-line based and doesn't require me to download huge files, but rather use an online database.

Not sure that such a tool exists. These days most people have no problem with disk space or downloading large files, so developers have little incentive to make tools that work directly with large online databases. Not to mention that those large databases prefer less online traffic if possible.

While CheckM does not fit your criteria - there is a large database to download - it may be worth your time. It will assess completeness and contamination of your bins, as well as perform taxonomic assignments.

ADD COMMENT
0
Entering edit mode

Hi Mensur,

CheckM seems to do the job, and the database is not too big. However, when I run checkM, i get the taxonomic assignments in my terminal, but i fail to see a file containing the information regarding that. I'm using the default Lineage-specific workflow that they suggest.

ADD REPLY
0
Entering edit mode

This is a late reply but for anyone wondering how to fix this use the option -f to redirect the stdout (with taxonomy table and stats) to a file:

  checkm lineage_wf bins outputfolder -f output.txt
ADD REPLY

Login before adding your answer.

Traffic: 1674 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6