Hi, I have a .vcf containing 1 million SNPs for a subset of a population (80 individuals) and what I would like to do is select SNPs that are informative for building pedigree and then redesign a probe set to target those SNPs for the rest of the population.
I am not quite sure how to determine which SNPs out of the 1 mil are actually kinship informative. I do know parent-offspring and full/half sib relationships for this subset of individuals but will need to be able to build a pedigree out to 4th degree relatives for the entire population. Does anyone have any suggestions on a good approach for selecting these SNPs? Would it be something similar to how I would select ancestry informative (which I have already mined out of this SNP dataset)?
Thanks!
Thank you, this is very helpful. I have the .ped and .map files already and will start to learn to do this in PLINK today.
One of my concerns is exactly what you stated about the relatedness measures being different. We know we are dealing with a high level of inbreeding. It will be interesting to see what we get from the PLINK analysis.