Picard vs samtools rmdup
2
0
Entering edit mode
7.6 years ago
xd_d ▴ 110

Hey all,

I want to remove duplicates from my bam file.

I use picard MarkDuplicates to remove the duplicates. (REMOVE_DUPLICATES=true)

After I run picard to "remove all duplicates" ,I found in the bam file reads that still flag MarkDuplicates and I found duplicate clusters that are not removed. I thought Picard remove all reads that are flag as Duplicates?

That's why I use samtools rmdup for paired end mode. It remove more reads than picard. But why ?

I thought when I use picard I remove all duplicates (optical and pcr)

I'm confused

RNA-Seq Samtools Picard • 12k views
ADD COMMENT
2
Entering edit mode

post exact commands and samtools flagstat output before and after removing duplicates

ADD REPLY
0
Entering edit mode

I post my problem below :)

ADD REPLY
1
Entering edit mode
ADD REPLY
0
Entering edit mode

I check this out. In the next time I post the picard problem that not really remove all duplicates.

ADD REPLY
0
Entering edit mode

I post it in the next time.

Finally, I want unique reads with unique coordinates

ADD REPLY
0
Entering edit mode

Also post examples of remnant duplicates.

ADD REPLY
0
Entering edit mode

Define unique coordinates further. Only one read covering every base or a read mapped starting at each base position?

ADD REPLY
0
Entering edit mode

my unique coordinates: only the start position should be uniques. If there a program to get these reads for bam files ? I know I lost information about paired end reads but this is not important for me in the next step.

ADD REPLY
1
Entering edit mode

I think a very similar question was recently asked here. Let me see if I can find that thread.

ADD REPLY
0
Entering edit mode

tank you ! Later I post the picard results that don't remove duplicate reads

ADD REPLY
0
Entering edit mode

thanks ! I used awk to get unique start positions : )

ADD REPLY
2
Entering edit mode
7.6 years ago
igor 13k

This previous thread about the exact differences between Samtools and Picard duplicate removal might be helpful: Picard MarkDuplicates and SamTools rmdup algorithm documentation

Also, this really old thread: http://seqanswers.com/forums/showthread.php?t=5424

ADD COMMENT
0
Entering edit mode
7.6 years ago
xd_d ▴ 110

i start a new thread , because picard is an another topic

ADD COMMENT

Login before adding your answer.

Traffic: 1427 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6