I have many VCFs from different samples. If I merge these into a single VCF using vcftools
(vcf-merge
), the samples where a variant wasn't called are labeled as missing that variant. Instead, I want the VCF to show that the sample has the reference allele (safe to assume in my application).
Is there a way to call missing variants in a VCF as the reference allele? What tools can I use to do this?
EDIT:
The sequences were originally variant called using FreeBayes (through the LongRanger pipeline).
RE-EDIT:
Turns out I can simply use the --ref-for-missing
flag in vcf-merge
to achieve this. Problem solved.
RE-RE-EDIT:
Using --ref-for-missing
flag in vcf-merge
does of course not give the variants any annotation, like depth and genotype quality.
I think
vcf-merge
with the--ref-for-missing
flag should solve the problem yourVcfNoCallToHomRef
solves. YourFixVcfMissingGenotypes
tool sounds really useful though.I didn't try your solution because annotating the depth only isn't enough for me. I accepted the answer anyway, as it solves the problem I stated.