I have been using the metagenome-atlas (https://metagenome-atlas.readthedocs.io/en/latest/index.html#) workflow to QC, assemble, bin, and annotate 12 different metagenome samples. Our lab collected some environmental samples and enriched them for certain types of bacteria/archaea. Each type of bacteria/archaea received one treatment and one control. We are trying to see if there's any difference between the treatment and control group, and in general, learn more about the metabolism and genetic information about the organisms in the enriched microbial community. The project is government sponsored so I can't reveal too many details, I apologize for the lack of specifics.
After running the workflow, I now have the .faa and .fna files of the 57 MAGs that were produced from these 12 samples. These are the predicted genes and translated protein sequences. I also have the functional annotations for each genome, in the form of a table of all the Kegg modules (the workflow used DRAM to annotate the genomes with functional annotations and infer the Kegg modules) and a table of all the annotations.
My lab has no prior experience in bioinformatics, and my supervisor has told me to produce "publication quality figures" to visualize the metagenome analysis data. I've been provided with some papers that did similar studies, and have been told to produce similar figures with very little direction or guidance on how to do this. I am not familiar with R, Python, or MATLAB. What are some visualization tools or software I could use to make sense of the functional annotation results and the MAGs that were produced? What do "publication quality figures" entail, and how do I go about producing these?
No matter what we tell you about it, your supervisor may or may not be happy with it. They should be telling you what the expectation is. They should also be providing support for you to do this work, because it is not easy to summarize large amounts of metagenomic data without prior experience.
I just clarified with them again and they said they want the taxa of the bacteria/archaea, genome completeness of abundant microbes, and the difference in expression levels of metabolic genes between the treatment and control group. They also want metabolic pathway identification of those genes. Unfortunately I don't have much support for this, the lab is spread too thin for anyone to work with me on this, and our company doesn't have another bioinformatic group (or any other scientific group actually) for us to collaborate with