Why does GATK LeftAlignAndTrimVariants set a missing genotype to 0/0?
0
0
Entering edit mode
8.1 years ago
Apprentice ▴ 170

Hi. I always appreciate many your helps.

I have one vcf file (a.vcf). This file has one variant data. The data also has missing genotypes "./." because of DP=0. The variant is tri-allelic variant as below.

"a.vcf"

CHROM POS ID REF ALT QUAL FILTER INFO FORMAT sample1 sample131 sample138 sample908

chr12 104350956 . G T,A 147880 PASS . GT:AD:DP:GQ ./.:0,0:0 0/1:25,22,0:47:99 0/0:36,0,0:36:99 ./.:0,0:0

I want to split the tri-allelic data into bi-allelic data, so I did the below command using GATK.

java -jar GenomeAnalysisTK.jar \

-T LeftAlignAndTrimVariants \

-R ${ref_path} \

--variant a.vcf \

-o b.vcf \

--splitMultiallelics \

--reference_window_stop 900

As a result, I got b.vcf. "b.vcf"

CHROM POS ID REF ALT QUAL FILTER INFO FORMAT sample1 sample131 sample138 sample908

chr12 104350956 . G T 147880 PASS AC=1;AF=0.250;AN=4 GT:AD:DP:GQ 0/0:.:0 0/1:25,22:47:99 0/0:36,0:36:99 0/0:.:0

chr12 104350956 . G A 147880 PASS AC=0;AF=0.00;AN=4 GT:AD:DP:GQ 0/0:.:0 0/0:25,0:47:99 0/0:36,0:36:99 0/0:.:0

In the b.vcf, thw splited two variants were bi-allelic data, but the missing genotypes were set to "0/0". I want to remain the missing genotype after the process of GATK.

How should I process the file?

GATK's version is 3.6.

sequence SNP next-gen • 3.0k views
ADD COMMENT
2
Entering edit mode
ADD REPLY
0
Entering edit mode
ADD REPLY

Login before adding your answer.

Traffic: 2414 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6