Hi folks,
I am trying to map reads to a set of metagenomes where I know that certain genes are duplicated but I cannot remove some of the duplication even with dedupe.sh. In order to improve detection and accurately represent my mapping, I want to find all sites where each read maps at the highest mapping quality. In other words, if the read mapped to a location A and B at MAPQ=30 and mapped to location C at MAPQ=25, I'd want to only report A and B mapping.
From my understanding, the BBMap ambiguous=all parameter would output A, B, and C mapping (as long as they are all in the clearing zone). Would ambiguous=best be a better fit for my purpose?
Please let me know if there is a better option for this!
Thanks!
GenoMax Brian Bushnell