Hello, I want to estimate allele frequency at different read depth (10X, 20X, 30X, and so on). I have variants call output in vcf format. I also estimated read depths for all SNPs. I am thinking to subsets the vcf at different depth for e.g. 10X, 20X etc. I am not sure if this is a correct way to proceed. Is there any better way to estimate allele frequency at different depths? Any suggestion will be appreciated. Thanks
Is it not that the sequencing depth affects the counts of ref allele and alternate allele? I think the depth could affect the VAF.
If you compare below variants,
So u see VAF for all three cases is 0.5 even if depth are different.
However for variant2, confidence is stronger than variant1 and 3 due to higher depth.
So when you apply different depth filter, basically you are segregating the variants in different confidence bins (higher depth, higher confidence)
Hope it was useful.
Thank you. How about the minor allele frequency? Can we still detect with low sequence depth?
One educational note, if you find any answers/suggestion useful, in this forum, you should upvote. This way the answer gets highlighted. So in future if someone wants to know about it, they can directly look into the right answer.
Coming to your query, yes we can detect MAF at low sequencing depth but higher is better to gain confidence on the variant call. What low depth you are expecting.
You can find many articles on google
you can refer some papers which work in similar domain.
you can search for MAF in this forum.