Entering edit mode
8.3 years ago
oluwaseyi.shorinola
▴
20
I have a multisample VCF4.2 file in which variants are called in two different samples against a reference genome. For variant site, the INFO fields DP;ADF;ADR;AD;AC1;DP4 . are returned as below:
CHROM POS ID REF ALT QUAL FILTER INFO
chr1A 100 . G C 30 . DP=5;ADF=1,1,0;ADR=0,3,0;AD=1,4,0;AC1=3;DP4=1,0,1,3
I am however getting 3 values in the ADF, ADR, and AD fields instead of two values i.e ref and alternate allele depth as expected for biallelic sites. And this is the same for all my ~20,000 biallelic sites. Can anyone kindly explain why this might be.
Many thanks
Olu
Splitting mulit-allelic sites into multiple vcf records will not clean up excess allele fields.
did you discover why this happens? I have the same problem