I'm trying to use BEAGLE 4 to do an IBD analysis.
I'm running the command: java -jar beagle.r1398.jar gt=vcf_format.vcf out=beagle ibd=true
where vcf_format.vcf
is a file I generated using PLINK 1.9 (with --recode vcf-iid
) from my PLINK format files (bed, bim, fam).
BEAGLE stops with an exception:
Duplicate marker: 1 72765116 rs2568958_r A G
I've searched for this marker in both the VCF file and the BIM file, and it appears only once in both.
Can anyone suggest what to do? Or how can I check differently for duplicates in my files?
Thanks
Implemented as
--list-duplicate-vars
in the December 9 development build. Sample workflow, if you don't have multiple variants with the same ID:Thank you chrchang523.
This is R script ready to run.