Hi,
I am trying to calculate linkage disequilibrium (LD) values for a group of SNPs to be used as an input for the tool Locus explorer . I followed the tutorial they had mentioned in their help section to create LD file from 1000 Genomes data, When I try to replicate their example ( example - 2.1.2 Calculate LD for list of SNP against all SNPs within 1000kb region.) I get the following plink error;
Note: No phenotypes present.
Error: Duplicate ID '.'.
I checked for possible solutions to this and most of them suggested to use
--list-duplicate-vars
function of plink, in this case there are no duplicate ids and even using the function results in same error. I intend to create a file like this (InputFileFormat - LDfile);
CHR_A - Chromosome on which Index SNP is located (n.b. do not include "chr"). e.g. 2, 23
BP_A - Index SNP start coordinate (Hg19, do not include chromosome or end coordinate for in/del variants). e.g. 104356185
SNP_A - Index SNP ID
CHR_B - Chromosome for SNP in LD with Index SNP (SNP_A)
BP_B - Start coordinate (Hg19, do not include chromosome or end coordinate for in/del variants) of SNP in LD with Index SNP (SNP_A). e.g. 104315667
SNP_B - ID of SNP in LD with Index SNP (SNP_A). e.g. rs10786679, chr10:104329988:D
R2 - LD score between SNP_A and SNP_B (0 to 1). e.g. 1, 0.740917
Note: Lead SNP must be defined relative to itself for plotting purposes, e.g.:
CHR_A BP_A SNP_A CHR_B BP_B SNP_B R2
2 173309618 rs13410475 2 173309618 rs13410475 1
2 173309618 rs13410475 2 172827293 rs148800555 0.0906124
Can anyone suggest a possible solution to this (with or without using PLINK)
Thanks.
Do you mind sharing your SNP file?
Try using --biallelic-only strict flag to drop SNPs that have same positions?
Added issue at GitHub, we will update the tutotrial file on next release
0.7.1
Hi, thank you for your reply. I tried the --biallelic-only strict flag as well, still the same error message.
The data is not my own so I am not at liberty to share it publicly. I have given a sample of the SNP file below;