Entering edit mode
8.8 years ago
hermathena
▴
40
Dear All,
I extracted some samples from a larger VCF (one population of a species). I would like to filter the sites by quality and depth. However, the values in the new VCF still seem to reflect the data from the bigger VCF (i.e. the average depth was calculated for all samples, not just the extracted ones). Do any of the usual tools (GATK, bcftools, etc) offer a way of recalculating site quality or the INFO fields for the current subset of samples?
Many thanks
Besides the issue of "software that can do that", there is the issue of whether the data to do so are supplied with the VCF samples. What format fields are present for the genotype records?
If you have GL/PL, some versions of bcftools can do this.
Indeed, the FORMAT fields available are AB, AD, DP, GQ, GT, MQ0, PL.
Which version of bcftools, please?
what is the command?