Question

Remove duplicates in featurecounts

0

Entering edit mode

11.1 years ago

lkmklsmn ▴ 980

Hi Biostars,

I used the featureCounts function from the Rsubread package in order to count reads. The first time I used my 'regular' bam files. The second time I used Picard Tools to mark duplicates in each bam file and used this data as substrate. I am getting the same number of counts in both cases using featureCounts. I was wondering if it is possible to tell featureCounts to exclude duplicates from the counting. Is there an option doing it within Rsubread or do I have to actually remove (as opposed to mark) the duplicate reads?

Thanks

featureCounts Picard RNA-Seq Rsubread • 5.0k views

ADD COMMENT • link updated 3.7 years ago by Ram 45k • written 11.1 years ago by lkmklsmn ▴ 980

Ram · Answer 1 · 2014-05-21

2

Entering edit mode

11.1 years ago

dbpzdbpz ▴ 220

The new version of subread (1.4.5) will provide a function to ignore reads or fragments that have the 0x400 flag (the duplicate read flag). This version should be released in days.

ADD COMMENT • link 11.1 years ago by dbpzdbpz ▴ 220

0

Entering edit mode

Sometimes in reserach "in days" can turn into weeks, months etc.

Do you know of any official release date for 1.4.5?

ADD REPLY • link 11.0 years ago by lkmklsmn ▴ 980

0

Entering edit mode

Sorry that it was delayed for weeks. The 1.4.5 version of subread (inc. featureCounts) was released yesterday on sf.net.

http://sourceforge.net/projects/subread/

ADD REPLY • link updated 5.7 years ago by Ram 45k • written 11.0 years ago by dbpzdbpz ▴ 220

Ram · Answer 2 · 2014-05-19

0

Entering edit mode

11.1 years ago

Chris Fields ★ 2.2k

It doesn't look as if this is supported, though technically it shouldn't be hard to add (they already parse read bit flags and attributes). Maybe ask the authors? Mailing list is here:

https://groups.google.com/forum/#!forum/subread

ADD COMMENT • link updated 5.4 years ago by Ram 45k • written 11.1 years ago by Chris Fields ★ 2.2k