problems to pass from QIIME to MEGAN5
2
2
Entering edit mode
9.7 years ago
imindias ▴ 20

Dear all,

First of all hello to everyone, this is my first post and I hope you can help me.

I'm an amateur in bioinformatics and I'm working in microbiota. We use 454 technology by Roche. To analyze diversities, I use qiime, it was difficult for me but finally I got it! but now I want to continue to the next step, the functionality.

I used to work with the tools that WebMGA gave me (fraggenescan, COG and KEGG databases) but I knew MEGAN and it seems very easy and with many opportunities for somebody like me that is learning now.

Well, please, don't think I am a completely idiot, but I am completely lost. I have used my out_table.biom from qiime into Megan and it's good, but I don't get new results, as I can't access to the COG and KEGG analysis.

So I tried to BLASTX my files, but it's impossible to use it (in fact, I am not sure if the files I am introducing are the correct), and I tried diamond, but I have few free space in my computer, and the analysis is killed in the middle.

I am completely desperate.

Please, can anybody indicate me the steps that I need to follow?

Thanks in advance!

qiime megan • 3.9k views
ADD COMMENT
1
Entering edit mode

Do you have wgs or amplicon data? If latter, is it 16S?

ADD REPLY
0
Entering edit mode

Yes, I have 16S amplicons

ADD REPLY
0
Entering edit mode

Many thanks!

I'll try it and I let you know.

Thanks!!!

ADD REPLY
0
Entering edit mode

Hi again,

I tried Picrust and the galaxy version, and I am stuck again.

I have my file: predict_metagenome.picrustp, that acts as a text file. Is this a .biom file? I'm not sure how to proceed now.

Thanks!

ADD REPLY
1
Entering edit mode

It's either a tsv or a biom file depending what output you specified. If you specified tsv (Legacy QIIME format (tab-delimited)), then one way forward is to edit out the first line and run the result through HUMAnN. Commands such as head and tail go a long way finding out what kind of files you're dealing with. IMO one of the core aspects of bioinformatics is understanding the nature of the data formats you're dealing with..

ADD REPLY
0
Entering edit mode

Dear Josh,

thank you for your so nice words. Sometimes, I feel as a complete idiot.

Well, I'm working with human gut microbiota, and I'm using the 454 technology by Roche.

I would like to use MEGAN, because I saw that permits a good visualization of, for example, KEGG pathways. But I'm open to another suggestion, taking into account my limited knowledge.

I started to sue WebMGA, because it's very simple and you don't need a good computer, but you only have some results, visualization is not possible and it's hard to analyze the data.

Thanks in advance

Isabel

ADD REPLY
0
Entering edit mode

First of all, you're not an idiot -- you got a biom file out of QIIME, so you're certainly not an idiot by any means!

So, as 5heikki mentioned, PICRUSt would be a good way to address putative function for the human 16S amplicons. If you're having compute memory issues (join the club!), you can always try the Huttenhower Galaxy server for PICRUSt which also has some other tools for visualization. Give those a test drive and if you have some questions, feel free to post again or send a direct message. Best of luck!

ADD REPLY
0
Entering edit mode
9.7 years ago
5heikki 11k

I would suggest closed-reference OTU-picking against Greengenes 13_8 (13_5) in QIIME and then load up the biom file into PICRUST for metagenome KO predictions. The galaxy version is super simple to use..

ADD COMMENT
0
Entering edit mode
9.7 years ago
Josh Herr 5.8k

Don't worry about being an amateur, we're here to help you! Everyone starts out at the beginning.

So, you have 16S data -- which only tells you about taxonomy in your system. I'm not certain what you want to show exactly, but you would like to show some functional information about your samples. As 5heikki mentioned, a good way to do this in by inferring function using your 16S data from the corresponding genomes via PICRUSt. I should warn that PICRUSt works well for human 16S sequencing where we have good representation of genomes to infer function from, but it doesn't work so well in environments (such as soil) where there are a lot of contributors not in the genome databases. You didn't tell us the type of data you have, so it's hard to recommend what to use.

I'm wondering if you'd like to use MEGAN to visualize your data -- MEGAN is a great BLAST visualizer, but there are other ways of looking at your biom file. I would highly recommend the R package phyloseq if you're looking to show your data structure. If you tell us a little about your data I could suggest more options -- there are many many to choose from.

ADD COMMENT

Login before adding your answer.

Traffic: 1665 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6