Question

Math behind association with PLINK

0

Entering edit mode

3.9 years ago

Will ▴ 20

Hi, which is the mathematical formula behind the --linear association used by plink ?

plink association gwas • 1.3k views

ADD COMMENT • link updated 3.1 years ago by jason.taotaotan ▴ 10 • written 3.9 years ago by Will ▴ 20

1

Entering edit mode

Please add some detail on what you're tried on your own to understand this. Have you read the plink paper(s)? Do you have a specific question? Did the papers/the manual mention anything about the --linearoperation?

ADD REPLY • link 3.9 years ago by Ram 44k

0

Entering edit mode

Hi, yes I read the plink paper (link), but in the section "association" I didn't understand when exactly talks about the --linear option. Because it talks about tests in general with a wide variety of formula. I can't be able to link the --linear option with the respective formula.

ADD REPLY • link 3.9 years ago by Will ▴ 20

score 0 · Answer 1 · 2021-01-14

0

Entering edit mode

3.9 years ago

Kevin Blighe 88k

The most basic association test is just a Chi-squared test on a 2 x 2 contingency table of the minor allele tallies, as to which I elaborate here: A: SNP dataset and Z Score

Any other test, such as linear / logistic regression, family-based tests, etc., are a mixture of again using minor allele tallies or genotypes encoded categorically (REF, HET, HOM) with different assumptions about inheritance patterns.

Perhaps focus on the mathematics of these specific tests outside of PLINK as opposed to finding the exact formulae within the PLINK documentation itself. PLINK just re-uses already-published statistical tests.

ADD COMMENT • link 3.9 years ago by Kevin Blighe 88k

0

Entering edit mode

Hi Kevin. I think we often see three genotypes (AA, Aa, aa), in which case we should have a 2*3 contingency table for a case-control study. Could you please explain how to do a chi-square test for this? My understanding is that for a 2*3 contingency matrix, the degree of freedom should be 2. However Plink still uses 1 df, which confuses me.

ADD REPLY • link 3.1 years ago by jason.taotaotan ▴ 10