Hi all
I'd like to know if anyone has or might know where to get a list of SNPs in high LD for the last human genome build (GRCh37). I'm currently working with the Brazilian population. I'm trying to do PCA for population structure analysis and, therefore, need to remove high LD regions from my dataset to get a low biased PC axes.
Thanks a lot in advance for the attention!
Best regards, Pedro. PS: Just --indep-pairwise option in PLINK won't solve my problem
Thanks a lot for you answer, really shed some light to my problem!
Do you know if there is the same data from the newest release of the 1000G project?
LD data for phase 3 on GRCh37 is available in the 1000 genomes browser, empowered by Ensembl. More details on that view can be found here.
Thank you very much. But is there also a ftp or whole genome download site?
Had a chat with my colleague Laura Clarke from the 1000 Genomes here at EMBL-EBI and this is what she said