Pruning a dense SNP dataset without losing too much LD
2
2
Entering edit mode
9.6 years ago

Hi,

I have a set of ~1.7 million SNPs and I want to bring it down to ~300,000, but without losing too much LD between the remaining SNPs (because the methods I'm using after pruning rely heavily on IBD tracts).

What are the best parameters for the --indep-pairwise flag in PLINK to create a dataset with the desirable features? Any ideas?

Thanks!

plink • 4.1k views
ADD COMMENT
2
Entering edit mode
6.6 years ago

If you’re explicitly trying to keep LD, and you’ve already performed basic quality control, —maf and —bp-space are probably the best tools to use.

ADD COMMENT
0
Entering edit mode
9.6 years ago

You may need to experiment with increasing the --indep-pairwise flag a bit to get down to 300k. There is really only one parameter to play with, so just keep going until you reach your desired density.

ADD COMMENT

Login before adding your answer.

Traffic: 2054 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6