How do I interpret genotype likelihood in the VCF format?
1
1
Entering edit mode
5.8 years ago
colin.kern ★ 1.1k

I've run FreeBayes on a few samples, but I'm not sure how to interpret the GL (genotype likelihood) part of the VCF format. As an example, here is the info from one of the samples:

GT:DP:AD:RO:QR:AO:QA:GL 0/1:28:13,15:13:521:15:611:-46.8976,0,-38.8067

I understand that there are 13 reads with the reference allele and 15 reads with the alternate allele, but I don't understand exactly what the p-values represented in the GL part are the significance values for, and why there are three values. The description I've found for GL is "Genotype Likelihood, log10-scaled likelihoods of the data given the called genotype for each possible genotype generated from the reference and alternate alleles given the sample ploidy.". So is it the p-values of 0/0, 1/1, and 0/1? What is the order? And why is one of them always log10 p-value of 0?

SNP vcf • 2.7k views
ADD COMMENT
3
Entering edit mode
5.8 years ago
colin.kern ★ 1.1k

Ok, it seems like the order is 0/0, 0/1, 1/1, but the numbers are normalized by the most likely genotype, which will therefore always be 0.

ADD COMMENT

Login before adding your answer.

Traffic: 2690 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6