Hello,
I am trying to figure out how to know if GWAS SNPs tagging a same or near region do come from the same real association signal or not. In other words, if a later GWAS identifies a close SNP to the signal we have, I would like to know if its an independent loci or its just a replica.
I started by trying to code a custom script for it only based on a certain LD threshold including SNPs within a specified kb range, so all SNPs under these conditions belong to a same signal. The problem is that if later I want to add a new set of SNPs mapping near these ones, calculations have to be redone exponentially increasing the SNPs combinations to check for LD, and it could be highly resource consuming.
Right now I am trying with the --clump
option from plink
, but as this is based in LD and p-value, I don't know if this would be a correct approach for it.
Do you have any advice, suggestion or preferred way/software to do this?
Yes, I already took into account the population issue. I work with European populations, so CEU.
Thanks, Larry, for pointing that!