Question

Pruning a dense SNP dataset without losing too much LD

2

Entering edit mode

9.7 years ago

yorgos.athanasiadis ▴ 70

Hi,

I have a set of ~1.7 million SNPs and I want to bring it down to ~300,000, but without losing too much LD between the remaining SNPs (because the methods I'm using after pruning rely heavily on IBD tracts).

What are the best parameters for the --indep-pairwise flag in PLINK to create a dataset with the desirable features? Any ideas?

Thanks!

plink • 4.1k views

ADD COMMENT • link updated 2.5 years ago by Ram 44k • written 9.7 years ago by yorgos.athanasiadis ▴ 70

score 2 · Answer 1 · 2018-05-06

2

Entering edit mode

6.6 years ago

chrchang523 11k

If you’re explicitly trying to keep LD, and you’ve already performed basic quality control, —maf and —bp-space are probably the best tools to use.

ADD COMMENT • link 6.6 years ago by chrchang523 11k

Ram · Answer 2 · 2015-04-25

0

Entering edit mode

9.7 years ago

Sean Davis 27k

You may need to experiment with increasing the --indep-pairwise flag a bit to get down to 300k. There is really only one parameter to play with, so just keep going until you reach your desired density.

ADD COMMENT • link updated 2.5 years ago by Ram 44k • written 9.7 years ago by Sean Davis 27k