split vcf, choice
1
0
Entering edit mode
6.6 years ago
emma.a ▴ 130

Hi,

I have to split a multisample VCF (the output of GATK4) into single vcfs (per sample). What is the best code? Then I have to annotate them with AnnovaR.

vcf-subset -c samplename -e inputvcf > sample.vcf

bcftools view -Oz -s samplename inputvcf > sample.vcf.gz

Is it correct to add the -e option in the vcf-subset code? or I'm going to lose data?

Best

vcf mergedvcf spltvcf • 1.2k views
ADD COMMENT
0
Entering edit mode

If that particular sample doesn't contain a variant i.e either not called or "reference homozygous", it will not be printed, which makes sense.

ADD REPLY

Login before adding your answer.

Traffic: 2781 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6