vcftools and SnpSift give different values for Ts/Tv ratio
1
2
Entering edit mode
9.8 years ago
ethan.kaufman ▴ 380

Hi all,

I'm trying to calculate the transition/transversion (Ts/Tv) ratio for my callset. Simple, right? However, I'm frustrated by the fact that two common vcf parsing tools (vcftools and SnpSift) give different results for this metric. Consider the following simple case:

8 variants: 4 Ts, 4 Tv. However, one of the Ts's and two of the Tv's are called homozygous. The rest are called hets.

Vcftools ignores the genotype information. Ts count: 4. Tv count: 4. Ts/Tv ratio = 1.

SnpSift counts homozygous variants twice. Ts count: 5. Tv count: 6. Ts/Tv ratio = 0.83

My feeling is that the genotype information shouldn't matter, since we want to count independent mutagenic events, and the two ALT alleles in a homozygous variant likely have a single mutagenic origin (given the slow rate of SNP evolution and the rarity of multiallelic SNPs). However, I've read a dozen papers on the subject and can't find a simple formula for calculating Ts/Tv in any of them. Is there any consensus on this?

Thanks

variant SNP vcf transversion transition • 5.0k views
ADD COMMENT
0
Entering edit mode
3.5 years ago
davidc • 0

I have noticed that bcftools stats counts the variant even when the genotype is 0/0 which it should not given that in that case the variant is listed in the VCF but not present. I would tend to agree therefore with SnpSift

ADD COMMENT

Login before adding your answer.

Traffic: 2538 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6