Hello
I have VCF outputs of Mutect2 (GATK4) with no downsampling. Some variants have multiple calls, which are comma separated in ALT column. I would like to preserve all of them before inputting in VEP including their respective TLOD and AF scores into a new VCF file where each alternative allele is in one separate line.
Here is an example:
original VCF file:
chr1 149813407 . G A,T . . DP=17510;ECNT=9;POP_AF=5.000e-08,5.000e-08;TLOD=4.36,49.82 GT:AD:AF:F1R2:F2R1:MBQ:MFRL:MMQ:MPOS:SA_MAP_AF:SA_POST_PROB 0/1/2:15000,22,59:0.049,0.051:7274,4,30:7726,18,29:32,33:149,153,144:48,45:13,7:0.010,0.010,3.912e-03:1.000,1.055e-10,3.564e-06
New VCF file:
chr1 149813407 . G A . . DP=17510;ECNT=9;POP_AF=5.000e-08;TLOD=4.36 GT:AD:AF:F1R2:F2R1:MBQ:MFRL:MMQ:MPOS:SA_MAP_AF:SA_POST_PROB 0/1/2:15000,22,59:0.049,0.051:7274,4,30:7726,18,29:32,33:149,153,144:48,45:13,7:0.010,0.010,3.912e-03:1.000,1.055e-10,3.564e-06
chr1 149813407 . G T . . DP=17510;ECNT=9;POP_AF=5.000e-08;TLOD=49.82 GT:AD:AF:F1R2:F2R1:MBQ:MFRL:MMQ:MPOS:SA_MAP_AF:SA_POST_PROB 0/1/2:15000,22,59:0.049,0.051:7274,4,30:7726,18,29:32,33:149,153,144:48,45:13,7:0.010,0.010,3.912e-03:1.000,1.055e-10,3.564e-06
I am very new in VCF processing but I have looked at bcftools and vcftools to no avail. I appreciate your help
Thank you
@OP: . There are tools to do that. I tested on VEP offline and current version doesn't support splitting of alt allele records in VCF output with default parameters VCF as output.
Yes I tried that too. The bcftools comment works though! Thanks