Snp Quality Distribution Peaks At 222 From Variant Call Pile
2
0
Entering edit mode
12.3 years ago
Juliofdiaz ▴ 140

I have reference mapped paired end illumina reads and called variants using BWA and Samtools respectively. The resulting vcf was treated to remove high coverage SNPs with vcfutils.pl varFilter -D30, and then filtered for low quality SNPs using awk '($3=="*"&&$6>=50)||($3!="*"&&$6>=20)'. I graphed the distribution of SNP quality and observed a huge peak at 222., I repeated it with other samples and observed the same peak. Any clues as to why I may be seeing this?

bwa samtools • 2.5k views
ADD COMMENT
1
Entering edit mode
9.5 years ago

If you do one sample at a time, 222 is the max quality allowable. So most of your called SNPs are of high quality.

ADD COMMENT
0
Entering edit mode
9.5 years ago
Marand • 0

I am observing something similar in my own data. Has anyone figured this out? The rest of the distribution is normal except for this massive peak (QUAL=222).

ADD COMMENT
0
Entering edit mode

I should also mention that I filtered with vcfutils varFilter -d 5 -D 25... mapping and snp calling were executed with the same software as Juliofdiaz

ADD REPLY

Login before adding your answer.

Traffic: 2674 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6