Hello,
Following the gatk best practices (link), I'm using MarkDuplicates tool in my somatic variant calling pipeline. when I run the pipeline on a targeted sequenced sample (600 genes by illumina) the duplication rate is really high:
In this case is it safe to use MarkDuplicates or I'm loosing a lot of informative reads which effect the certainty of the called somatic variants.