vcftools nucleotide diversity statistic (pi)
1
2
Entering edit mode
6.7 years ago
modi2020 ▴ 40

Dear fellows:

I know that Nei's Pi (nucleotide diversity statistic) is calculated per site using sequences belonging to more than one individuals. However, in vcftools, it seems like they can be calculated on a per individual bases (https://sourceforge.net/p/vcftools/mailman/message/29923488/). In another post by vcftools, it was defined as "the average number of differences between a pair of chromosomes" (https://sourceforge.net/p/vcftools/bugs/42/) which means to me that only one individual is enough for its calculation. We plan to use this method in a paper we are about to write since we don't have more than one individual per breed. Does anyone know the formula used in vcftools or if it is at all valid to calculate Pi in this way?

I would really appreciate your kind advice on this.

vcftools • 8.5k views
ADD COMMENT
2
Entering edit mode
6.7 years ago
caw5cv ▴ 20

Nucleotide diversity is a population-level metric, the average number of differences between a pair of chromosomes, across all chromosome combinations within the population. If you only have one individual, maybe you're looking to report heterozygosity instead?

ADD COMMENT
0
Entering edit mode

Thank you so much caw5cv. I realize that its a population level metric. However, it seems like in vcftools its calculated on a per individual basis for SNPs in non-overlapping windows. Is that still useful if one had SNPs called on a single genome.

ADD REPLY

Login before adding your answer.

Traffic: 2516 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6