Hi Every one
I used to get this error when I try to split a vcf I get from plink format:
[E::bcf_hdr_add_sample] Duplicated sample name '103_Sp676'
These are my steps to remove duplicates, however I still get the same error:
data.bed:
plink --file data --maf 0.05 --make-bed --out data
data.DuplicatesRemoved.bed:
plink --bfile data --list-duplicate-vars ids-only suppress-first
plink --bfile data -exclude plink.dupvar --make-bed --out data.DuplicatesRemoved
data.DuplicatesRemoved.ped:
plink --bfile data.DuplicatesRemoved --recode --tab --out data.DuplicatesRemoved
data.DuplicatesRemoved.vcf:
plink --file data.DuplicatesRemoved --recode vcf --out data.DuplicatesRemoved
Any help how to fix?
Thanks
Also added
plink
tag to your post. That way, it may be picked up by the person who is much more experienced in plink than anyone else here on Biostars.