Entering edit mode
11.1 years ago
cvu
▴
180
I've generated vcf file using samtools mpileup. I understood all the fields in vcf file, except INFO column.
DP=55;VDB=2.813188e-01;AF1=1;AC1=2;DP4=0,0,15,15;MQ=44;FQ=-117
From this I can interpret DP, MQ and FQ, but what are VDB, AF1, AC1, Dp4?
Can anyone explain to me?
I've changed the category of this post from 'Blog' to 'Question', because it is actually a question and not a blog post. Have a look at the VCF format specifications to have an idea on how to interpret the INFO field. You should also look at how these elements are defined in the header of the VCF file itself.
sorry to ask basic
please explain me following things
AF1: Max-likelihood estimate of the first ALT allele frequency (assuming HWE)
DP4=high-quality ref-forward bases, ref-reverse, alt-forward and alt-reverse bases
VDB=Variant Distance Bias (v2) for filtering splice-site artefacts in RNA-seq data
PV4=P-values for strand bias, baseQ bias, mapQ bias and tail distance bias
Which parts of those don't you understand?
How samtools count values for AF1 Ac1 VDB PV4 DP4?
By "count" do you mean "calculate"? Some of this is described in a PDF from Heng Li. VDB is described schematically in this PDF. You should be able to find anything else with google.
i want know how significant they are when finding SNPs in human for clinical purpose ?
Well, it's probably a good idea to perform some filtering according to them and check if any SNPs of particular interest show any particular bias. That's basically the point of these various metrics.
thanks Devon Ryan !!