Entering edit mode
7.6 years ago
zhangdezhi008
▴
20
Dear all, I used samtools for SNPs calling and vcftools for SNPs filtering. I got a vcf file with a lot SNPs.How to interpret the genotypes when DP=1? In my opinion, DP=1 means that this site has only one read, it could be homozygous 0/0 or 1/1, but how can it be a heterozygous 0/1?
The following is what I have observed in my vcf file.
Thanks for your attentions! Please help!
GT:PL:DP:SP:GQ 0/0:0,3,36:1:0:4
GT:PL:DP:SP:GQ 0/1:0,3,36:1:0:4
GT:PL:DP:SP:GQ 1/1:0,3,36:1:0:4
It can be heterozygous if you find one read with the alternative base (instead of the reference base). But do you trust these DP=1 calls? I mean 1 read is pretty minimal.
Edit: I see your point, you mean with one read of the alt allele it could be both homozygous or heterozygous.
Thanks for your reply. I will not trust the lower DP SNPs. How can one read possesses an alternative base? I also have another question, how can we know the depth for each allele for a heterozygous site?
I am not sure about your method, I never used vcftools for this. When using varscan after samtools, I get more info than you get: e.g.,
With other meaning for DP as well:
DP4 field has information about reads that support: reference positive strand, reference negative, alternative positive, alternative negative
But check out documentation if it's in that order