Hi everyone,
I have been computing Watterson theta for the first time and I am using ANGSD . For my understanding this value can be from 0 to 1. However on ANGSD use likelihood to run it and this is the results I have when I compute theta on Chr22 on one bamfile:
(indexStart,indexStop)(firstPos_withData,lastPos_withData)(WinStart,WinStop) Chr WinCenter tW tP tF tH tL Tajima fuf fud fayh zeng nSites
(0,32860927)(16050032,51244307)(0,51244307) 22 25622153 31780.907015 31780.907015 31780.907015 31780.907015 31780.907015 -nan -nan -nan -nan -nan 32860927
So what I am interested in is tW (theta Watterson).. but this value means no sense to me. I think I do have to divide it by the nSites which would give me tW=0.0009671336118. Does it make sense to do that ?
I also have the results per sites on a log scale that looks like that:
#Chromo Pos Watterson Pairwise thetaSingleton thetaH thetaL
22 16050032 -7.634670 -7.634670 -7.634670 -7.634670 -7.634670
22 16050033 -7.634670 -7.634670 -7.634670 -7.634670 -7.634670
22 16050034 -7.634670 -7.634670 -7.634670 -7.634670 -7.634670
22 16050035 -7.634670 -7.634670 -7.634670 -7.634670 -7.634670
22 16050036 -7.634670 -7.634670 -7.634670 -7.634670 -7.634670
Is it better do compute the overall Wtheta using those values ? If yes, should I still do the sum and dividing them by the total number of sites ?
Sorry for the newbie questions, new on this field. Thanks a lot
WIKI: In population genetics, the Watterson estimator is a method for describing the genetic diversity in a population. . So what does it mean on a genomic coordinate level? Maybe theta is each site's contribution. How does it vary across a chromosome?
I think in this way they use the prior knowing the reference true site ? I have no clear idea of it either but this is how I understand it. This is how the value fluctuates, doesn't really make sense to me either:
Hello,
I'm facing the same problem, I have the output of ANGSD and I was wandering how to get the Pi and Theta from the results. Trying to make the average gives me -inf result. Have you find a solution?
Many thanks