Hi.
Thank you for always help. I have an additional problem.
I would like to merge two vcf files (a.vcf, b.vcf) into one vcf file (c.vcf) using GATK CombineVariants. a.vcf and b.vcf have have same variants but don't regard same variants. Specifically, a.vcf and b.vcf are shown as below;
$ cat a.vcf
CHROM POS ID REF ALT QUAL FILTER INFO FORMAT SampleA SampleB
chr1 897460 v5_202 A <*:DEL> . PASS . GT:AD:DP 0/0:20,0:20 0/1:14,14:28
$ cat b.vcf
CHROM POS ID REF ALT QUAL FILTER INFO FORMAT SampleC SampleD
chr1 897459 v6_202 CA C 2068.83 PASS . GT:AD:DP 0/0:43,0:43 0/1:40,6:46
As you can see, these files have a data of same variant, but coordinates are different. I want to merge the two files into one file and merge these two variant data into one variant data using GATK CombineVariants.
How should I merge the files?
How can a same variant have different coordinates in different samples. From the ID, it looks like they have been processed using different versions of "something".
v5_202
v6_202
So you can't really merge them or they will be represented twice in your VCF file as separate variants.Thank you for your comment.
Each vcf file was separately called using samples from different capture kit (V5, V6).
Even though It cant have different coordinates for same variants. Essentially you can't merged these two unless have same coordinates.
In that case have them as separate variants or manually correct one of the coordinates. But I do not know if that will have any downstream effects.