Entering edit mode
10.9 years ago
RawRaw
•
0
Do we need to impute the missing values before finding the significant markers in genome wide association studies? Specifically, using PLINKs permutation tests to find the adjusted p-values that state the significant markers. If not, how does PLINK handles the missing values?
" I guess PLINK will treat your missing genotypes as missing data, both in the association analysis and in the permutations." How, and why not imputing them?
Nobody is stopping you from imputing with other software. But the chi-square/Fisher's exact tests used by PLINK's basic association analysis commands work just fine with (not-terribly-biased) missing data.
Note that PLINK 1.9's --assoc and --model permutation tests are hundreds to thousands of times faster than PLINK 1.07's.
@chrchang523 So, PLINK just neglects the missing SNPs? This means the numerical perturbation caused be the missing SNPs is negligible and could be discarded in MaxT permutation tests (assuming that there are very few missing SNPs, and not-terribly-biased).
Since PLINK 1.9's permutation procedures extend work by Brian Browning (PRESTO) and Roman Pahl (PERMORY), it would be much, much faster than 1.7s.