I have an annotation file in VCF format for an insertion at a particular position on a human chromosome; my inquiry focuses on interpreting some of following terms in determining/indicating how real/probable a variant is from sequencing before confirming with assays. (I apologize in advance for any misconceptions that I have in using any terms.)
GT:GQ:DP:PL:AD (order of terms in file)
where GT = genotype, GQ = genotype quality, DP = read depth, PL = phred-scaled genotype likelihoods, AD = allelic depth
For example, suppose that my file has the following entry (with the organization as described above) for an individual:
0/1:99:47:624,0,503:26,21
For AD, if there is not a great disparity between the reference and alternate values (last two numbers), doe this indicate that the variant can really exist for the particular person or is more "real"?
Thanks for any help.
Thank you for answering.
click on accept answer if you found the answer satisfactory :-)