Hi everybody,
From a mapping file, I used the fonction to assess my duplicates rates
picard-tools MarkDuplicates I=input.bam O=output.bam M=marked_dup_metrics.txt
I would like to creates 2 fastq files (paired end reads) without those duplicates and keep mate as possible.
Should I use samtools to keep only non duplicates paired reads ? what would be the flag/command ?
Thanks for your help.
If you have not done so already then run
MarkDuplicates
withREMOVE_DUPLICATES=TRUE
option. Then @Medhat's methods mentioned below will work.