Hi,
I have a set of ~1.7 million SNPs and I want to bring it down to ~300,000, but without losing too much LD between the remaining SNPs (because the methods I'm using after pruning rely heavily on IBD tracts).
What are the best parameters for the --indep-pairwise flag in PLINK to create a dataset with the desirable features? Any ideas?
Thanks!