Hello,
I have short reads from one non-model genome mapped against a very closely related genome assembly and want to examine variantion. I did standard variant calling, filtering (for reasonable depth, QUAL>50, MQ>55). How is it possibe that variants that passed all those filters look that bad (see the screenshot)?? Those are clearly mismapping reads (and artifactual variants). Why does the MQ filter fail? This is driving me crazy, I'd appreciate any suggestions.
Interesting suggestions, thank you! I think this variant is only supported by mismapped reads. Might finally try to understand the MQ formula (https://genome.sph.umich.edu/wiki/Mapping_Quality_Scores), but I thought high MQ is impossible with so many differences - unless it force maps reads to the reference and scales MQ relative to other possible alignments??