After providing answers to over 100 questions here, I now have one of my own. Actually, this is a two-part question. What tool(s) do you use to calculate genetic heterogeneity from SNP genotype data collected across an entire chromosome or genome? If the measure of heterogeneity is at or near zero, then the individual (human, animal, plant) is a product of inbreeding. This number will rise as the parents come from increasingly divergent genetic backgrounds.
That then brings up the second part of the question. For those of you who have looked into such measures of genetic diversity or heterogeneity, how useful is this and what kinds of values can I expect from the human genome-wide SNP genotypes I have? A preliminary and crude analysis gave me 69% of SNPs across chromosome 7 as homozygous, but that value rises to 92+% across two small HLA loci. That seems interesting but I don't know where to go with this.
Thanks in advance for any insight, advice.
I know that without any doubt. My feeling is the MHC will have high homogeneity. I'm interested in any tools that can do the calculations across any range of input SNPs, provided those are in genome order, and I'm curious of others' experiences with these calculations. Thanks, Al.
Hi Larry, I am not sure if comparing the HLA loci to the genome as a whole is a fair comparison. It would be more interesting to compare to other loci in the MHC which are likely have have undergone similar historic selection.
I think that's quite odd this level of homozigosity in a populational sense. Are you using the entire HapMap? How many haplotype blocks?
I have genotype data for an individual across the entire genome. So, I could look at heterozygosity vs homozygosity (or rates of heterogeneity) for that individual across a chromosome or gene region or region of any size.
As a follow-up question: are there ways to also quantify heterogeneity from RNA-seq (or transcriptomics) data?