Entering edit mode
10.6 years ago
zhangz.cs
▴
300
I got the file union.2010_06.deletions.genotypes.vcf.gz
at the 1000genomes ftp site. What information in the file can help me which deletions are homogeneous and which are heterogeneous? Thank for your help in advance.
The beginning of the file header is as follows:
##fileformat=VCFv4.0
##ALT=<ID=DEL,Description="Deletion">
##FILTER=<ID=LowQual,Description="Genotype call confidence below LOD 1.3">
##FORMAT=<ID=CN,Number=1,Type=Integer,Description="Copy number genotype for imprecise events">
[...]
To expand a bit, you want to look at the Genotype (GT) field for each sample of interest. This GATK resource provides a more detailed explanation: http://gatkforums.broadinstitute.org/discussion/1268/how-should-i-interpret-vcf-files-produced-by-the-gatk