My lab is looking to get into Whole Genome Sequencing, and we of course need a processing and analysis pipeline for the data we generate. I've been reading up on WGS / WES, and I understand the basics but I haven't seen any "best practices" type documents / tutorials like those that exist for single cell and bulk RNAseq (my typical work). Since this is the first time I'll have analyzed WGS data I'm less familiar with the literature surrounding the topic, and would greatly appreciate if someone could point me in the right direction.
Check the GATK Best Practices: https://gatk.broadinstitute.org/hc/en-us
Check SpeedSeq. Unlike GATK it is not as painful to use imho.
Are either of GATK or SpeedSeq capable of analyzing the microbiome of a sample? I should have clarified in my original post that that's what we're primarily interested in.
When people discuss WGS analysis, they usually mean a single genome. The goal is to find variants (point mutations, insertions/deletions, copy number, etc.).
For metagenomics/microbiomics, the goal is usually to quantify different species. Thus, the inputs, the outputs, and the algorithms will be very different.
Ah ok. Thank you for the clarification.