Question

Getting p-values for whether the allele frequencies from two or more populations for 1800 SNPs are different

0

Entering edit mode

8.3 years ago

devenvyas ▴ 760

I've got a set of 1800 SNPs, which I am studying. I have the allele frequencies in two related populations and the counts for each allele (i.e., I have plink --freq --missing --within cluster --out output). For the most part, the allele frequencies track each other. My null hypothesis is that for each individual SNP the allele frequency is equal. I want to figure out p-values to determine whether to reject the null hypothesis.

I have access to R and to JMP Genomics. Any suggestions on how to do this?

snp • 3.5k views

ADD COMMENT • link updated 8.2 years ago by abascalfederico ★ 1.2k • written 8.3 years ago by devenvyas ▴ 760

0

Entering edit mode

Exactly equal or approximately equal?

Essentially, this sounds like an association analysis in which you want to check if one allele is more common in one population than in another than expected by chance.

ADD REPLY • link 8.3 years ago by WouterDeCoster 47k

0

Entering edit mode

Approximately equal

Basically yes, but I want to do that for 1800 individual SNPs.

ADD REPLY • link 8.3 years ago by devenvyas ▴ 760

0

Entering edit mode

What about http://pngu.mgh.harvard.edu/~purcell/plink/anal.shtml then?

ADD REPLY • link 8.3 years ago by WouterDeCoster 47k

0

Entering edit mode

I am trying to figure out how to get that to work with Plink 1.9, which lets me specify which allele is which. Also, my populations are defined by a cluster file as opposed to base on case/control.

Any other ideas how to do this?

ADD REPLY • link 8.3 years ago by devenvyas ▴ 760

0

Entering edit mode

I don't see the problem with population not being case/control, you can just do association of population A vs population B. You might van population stratification issues, but technically it shouldn't be a problem.

ADD REPLY • link 8.3 years ago by WouterDeCoster 47k

0

Entering edit mode

But how do I specify to Plink to do it based on clusters instead of case/control?

ADD REPLY • link 8.2 years ago by devenvyas ▴ 760

score 0 · Answer 1 · 2016-09-08

0

Entering edit mode

8.2 years ago

abascalfederico ★ 1.2k

From a purely statistical point of view you could check if the allele frequencies are different with a Fisher's exact test. From a biological point of view don't see if that would mean anything

ADD COMMENT • link 8.2 years ago by abascalfederico ★ 1.2k