Hi,
this is a very basic question but I can't find a proper explanation anywhere.. yes I have read the .vcf specification documents.
How are 'AC' and 'AN' calculated? I have human vcf files where there are homozygote variants with AC=2;AN=2 and heterozygote variants with AC=1; AN=2. If I had multiple samples with multiple variants, how are AC and AN calculated? So does AC ignore the reference allele? The vcf spec is not very helpful with "allele count in genotypes, for each ALT allele, in the same order as listed"?
Thanks,
S.
Yes, that is exactly from the vcf specs. It does not say how they are calculated for multiple samples, e.g. if I have three samples with genotypes 0/1, 1/1, 0/0 what is AC and AN?
In my understanding
AC
would be 3 (because you just have one ALT allele which is found 3 times) andAN
would be 6 (because you have 3 samples with 2 alleles each).It's getting more interesting if you have more than one ALT allele. Let's assume the genotype 0/1, 1/1 and 1/2.
AC
is now 4,1 (4 times ALT allele 1 and 1 time ALT allele 2) andAN
is still 6.