Hi all,
I am very interested in ancestry inference.
- autosomal ancestry I've followed the tutorial from https://github.com/pcgoddard/Burchardlab_Tutorials/wiki/ADMIXTURE#popfile.
However, decision on cluster information K is quite confusing.
For example, there are already 12 populations among my data. How can I train these data so that I can fully separate the populations.
- Y haplogroup inference yhaplo https://github.com/23andMe/yhaplo/ offered by 23andme can do this, however, the ISOGG in this repo is too old. And the author did not update them more. So I am wondering if there is any other similar tool that can predict the haplogroup as well as building the ISOGG tree myself.
Junfeng
Since you are already using admixture software, check out its manual: http://www.genetics.ucla.edu/software/admixture/admixture-manual.pdf
I am not sure why the link is not opening, probably some updation is going in the server where it is stored, but, in the document, an explanation has been provided on how to select cluster information K through estimation of cross-validation error.