What is the best way to download somatic mutations called from whole genome sequencing data in the cancer genome atlas (TCGA) project? Downloading somatic mutations in VCF for MAF format is preferable, but I will also download BAMs and call mutations if necessary.
I have spent a lot of time browsing the Genomic Data Commons (GDC) data portal (https://gdc-portal.nci.nih.gov/) today and can find only whole exome sequencing data.
Some of the MAF files have somatic variants from WGS, check Sequence_Source column in MAF file. Its also worth mentioning that TCGA only provides somatic variants and no germline snps for privacy reasons. Not sure if one can get WGS bam files for the same reason.
Hi Charles, I am resuscitating this old post since I am facing the same issue right now. Did you finally find where to be able to download cancer somatic mutations from WGS? Thanks
I have posted an answer below.