What Is The Importance Of The Hardy Weinberg Equilibrium In Gwas?
3
17
Entering edit mode
11.7 years ago
Lara Endres ▴ 180

I´d like to understand what the hardy weinberg equilibrium represents in GWAS. What does it means to say that a SNP is not in equilibrium?

gwas snp • 27k views
ADD COMMENT
1
Entering edit mode

This is a case where the Wikipedia entry is well worth reading.

ADD REPLY
0
Entering edit mode

There is a nice recent discussion in Reddit about why HW is used even if it never occurs in reality. It's not exactly what you were asking for, but I recommend you to read it. Anyway, basically HW is used to determine if a genotype can be a sequencing error, or to infer population structure.

ADD REPLY
0
Entering edit mode

Thank you Giovanni, I liked the discussion, it'ss really helpful.

ADD REPLY
8
Entering edit mode
11.7 years ago
Ryan D ★ 3.4k

Under Hardy-Weinberg assumptions, allele and genotype frequencies can be estimated. When the ratios of homozygous and heterozygous genotypes significantly differ from the prediction under HWE assumptions, it can indicate genotyping errors, batch effects, population stratification, or--much more rarely--association. Since the last of those is least likely, typically departure from HWE is an indicator that a marker should be discarded. I find this is typically true unless adjacent markers in LD with one another both violate HWE.

Principal components are the most common means by which to correct population stratification or some batch effects. You can also examine LD between the markers you would otherwise discard for violating HWE to see if you might accidentally discard markers that are legitimately associated and do not indicate the other types of error present.

ADD COMMENT
1
Entering edit mode

Just to clarify, if other SNPs expected to be in high LD with your SNP of interest also display deviations from HWE, this would mean it is likely to be a true association and not a genotyping (or other) error, correct?

ADD REPLY
2
Entering edit mode

I would say yes, if several SNPs are in high LD (say r2>0.8 or 0.9) and all out of HWE, you can typically assume it is a legitimate association. The chance of having 3 or more SNPs in high LD fail HWE checks because they all genotype badly is pretty low. The only other way this could happen--that I can think of--is if all of these SNPs were in a copy number variable region.

ADD REPLY
0
Entering edit mode

Okay great. Thanks!

ADD REPLY
5
Entering edit mode
11.7 years ago

You looked up the Hardy-Weinburg Equilbriaum equation, right? And how it predicts all the possible genotypes should be in particular ratios?

In general, if you see an allele out of HWE, it might be under selection. Or, you might have a consistent genotyping error. I think those are the biggest worries with that kind of data.

ADD COMMENT
1
Entering edit mode
11.7 years ago
Lara Endres ▴ 180

I couldn't understand it, cause I thought that, if a SNP differs from the prediction under HWE, that difference is probably caused by an association, so why discard it? But I see now.. good answers, I understood! Thank you both!

ADD COMMENT
0
Entering edit mode

Hope it was helpful. Don't forget to select a "correct" answer if you feel it was useful. Principal components, BTW, can be very powerful for correcting for a number of errors. Learning how to use them--especially if your GWAS may have those issues (batch issues like date, genotyping center, sample type, genotyping platform, or population stratification) can save you from publishing a terrible paper.

ADD REPLY
0
Entering edit mode

Could you explain how to use PCA for this purpose, please?

ADD REPLY

Login before adding your answer.

Traffic: 2563 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6