Proxy SNPs from 1000 genome phase3
1
1
Entering edit mode
5.7 years ago
bayoteju ▴ 10

I have a list of about 250 SNPs. I want to obtain the proxy SNPs for them using the 1000 genomes. I have downloaded corresponding .vcf and .ped files from 1000 genomes. I have also used PLINK to obtain the .bim, .fam and .bed files.

How do I proceed to determine the proxy SNPs without using any software?

Thank you.

snp plink ld • 2.3k views
ADD COMMENT
0
Entering edit mode

Thank you very much. I am using snpStat to calculate the LDs.

ADD REPLY
0
Entering edit mode
5.7 years ago
zx8754 12k

Get LD for selected SNPs using plink --r2. Then pick any proxy SNP having highest LD score.

Why not using other software? We could, for example, use PriorityPruner which is specifically designed to pick proxies with prioritisation, i.e. when SNP1 is in LD with SNP2 and SNP3, it will prioritise based on other features of the SNP (like pvalue, frequency, annotations, etc.).

PriorityPruner is a software program which can prune a list of SNPs that are in high linkage disequilibrium (LD) with other SNPs in the list, while preferentially keeping SNPs of higher priority (e.g., the most significant SNPs in a genome-wide association study). The process of pruning SNPs based on LD is sometimes referred to as "LD clumping".

ADD COMMENT

Login before adding your answer.

Traffic: 2144 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6