Does anyone have a numerical example on how the EM algorithm can be used to determine haplotype frequencies from genotype frequencies? I have searched a lot with Google, and I just can't find a single numerical example out there. Thank you.
Does anyone have a numerical example on how the EM algorithm can be used to determine haplotype frequencies from genotype frequencies? I have searched a lot with Google, and I just can't find a single numerical example out there. Thank you.
tommy.carstensen The expectation is explained on the Linkage Dis-equilibrium wiki page
http://en.wikipedia.org/wiki/Linkage_disequilibrium
Take a close look at the example:
Example: Human Leukocyte Antigen (HLA) alleles
specifically:
and the estimated frequency of haplotype xy is
If this doesn't answer your question I will attempt to explain it in other terms.
I found a paper pointing me in the direction of an answer to my question: "Linkage Disequilibrium Between Loci With Unknown Phase" http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2710162/pdf/GEN1823839.pdf
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Is the end goal to associate a phenotype to a haplotype? If so, BEAGLE has auxiliary scripts to do this. Do you have phased data? Can you provide a little more background?
Hi Zev. The end goal is to understand, how PLINK, WDIST and other software packages estimate haplotype frequencies from genotype data using the EM algorithm cf. http://pngu.mgh.harvard.edu/~purcell/plink/ld.shtml
Can I close this question as nobody seems to know the answer?
tommy.carstensen http://www.slideshare.net/awais77/measures-of-linkage-disequilibrium-10238151. These slides have an empirical example: slides 14-16. "calculating linkage disequilibrium R squared" - google search.
Thanks. That is an example with haplotypes. I am interested in how the haplotype frequencies are calculated from the genotypes with the EM algorithm.