1/2 in VCF genotype field?
2
3
Entering edit mode
8.6 years ago
mbk0asis ▴ 700

Hi.

I made vcf files using UnifiedGenotyper in GATK and found that many of mutants had 1/2 at genotype (GT) field.

As I understand, 0/0 = homo ref, 0/1 = hetero, 1/1 = homo alt.

Does anyone know what exactly 1/2 mean?

Thank you!

vcf zygosity • 10k views
ADD COMMENT
10
Entering edit mode
8.6 years ago

There are apparently two alternate alleles at this position. So if the reference is C, then perhaps you have an A and a G.

ADD COMMENT
10
Entering edit mode
8.6 years ago

just to expand Devon's answer, which is perfectly correct...

this is due to a VCF format constrain, which states that 0 must always be the reference allele. this doesn't mean that the reference allele must be in a particular sample, it just means that a particular base is used as reference by a scientific community agreement.

in case you're working with human samples, this agreement is now supported by the Genome Reference Consortium, and I really hope that when the time comes (we shouldn't be that far) they'll be able to build a consensus reference based on entire world populations allele frequencies, but meanwhile we have to stick to the current reference. current GRCh38 has already done some work in this direction though.

since humans are diploids, sequencing a particular position will give you a pair of alleles, and usually the reference will be 1 of that 2 alleles. but this is not always the case, and you may find (as you already did) that a particular position doesn't contain the reference allele. this is rare, but far from being impossible due to the lack of population genetics evidences in the reference selected, as explained previously.

ADD COMMENT

Login before adding your answer.

Traffic: 1989 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6