Note: This is different from the many vcf2maf
questions posted around Biostars.
Like many others, I've been in need to generate a MAF file for use in downstream applications like maftools
or TRONCO
. My final goal is to obtain a multi-sample MAF file like the ones shipped by TCGA.
However, I can't use vcf2maf
, because it requires VCFs to only have 2 samples. I have either:
- Several individual VCFs (1 per sample)
- One single VCF with information from all samples (with the multiallelic loci decomposed)
The first case would fit vcf2maf
, however there is no indication on how to merge multiple MAF files into a single one. And the second one is totally incompatible with vcf2maf
due to the use of multiple samples.
The GDC Data Portal docs mention an aggregation workflow which starts from VCFs and ends in MAF, however I've poked around in the GDC sources and I couldn't find anything.
I know this is possible because TCGA does it: but are there any tools to perform this task?
have you figured it out