Question

mistakenly ran featureCounts in paired-end mode on single-read data

0

Entering edit mode

5.6 years ago

clboozy • 0

I am looking through an old pipeline that was run over a year ago in preparation for submitting data to GEO. I discovered that although the sequencing in this experiment was single-read (vs. paired-end), I had run featureCounts in paired-end mode (with a parameter of -p). According to the featureCounts documentation, the -p flag has the following definition: "If specified, fragments (or templates) will be counted instead of reads. This option is only applicable for paired-end reads." Did adding this parameter by mistake affect the run at all? Or did it not matter as all samples were single-read anyway?

RNA-Seq featureCounts • 6.2k views

ADD COMMENT • link updated 17 months ago by DareDevil ★ 4.3k • written 5.6 years ago by clboozy • 0

2

Entering edit mode

Have you submitted the data to GEO?

ADD REPLY • link 17 months ago by DareDevil ★ 4.3k

1

Entering edit mode

As far as I know, it doesn't effect results if you use -p on SE data. But you could quickly check it by running on any bam you have (with and without -p)

ADD REPLY • link 5.6 years ago by GouthamAtla 12k

1

Entering edit mode

Indeed, the scientist within you should run it with and without, and then cross-compare results.

ADD REPLY • link 5.6 years ago by Kevin Blighe 88k

0

Entering edit mode

Agreed! That's what I would have done if I still had access to the bam files... unfortunately, I do not.

ADD REPLY • link 5.6 years ago by clboozy • 0

0

Entering edit mode

Cool. I would have hoped that featureCounts issued a warning message, at least (?). Keep in mind that these counting methods are fairly rudimentary - one can perform read count abundance using BEDTools or custom scripts, if one wishes.

ADD REPLY • link 5.6 years ago by Kevin Blighe 88k

0

Entering edit mode

except that featureCounts is blazingly fast and comes with tons of options

ADD REPLY • link 5.6 years ago by Friederike 9.0k

0

Entering edit mode

Thanks! Unfortunately, I don't have access to any SE bam files currently.

ADD REPLY • link 5.6 years ago by clboozy • 0

score 1 · Answer 1 · 2019-05-08

As geek_y and Kevin Blighe said, test for yourself.

Even if do not have access to the original bams, you can easily grab single-end fastq (in which case you will have to align) and / or bam files, and run featureCounts twice, withand without -p. The result of this test will tell you if your original counts are correct or not.

You don't want to submit potentially bogus results to GEO (with your name on it) based on "but this internet guy told me my counts were fine", do you?

score 1 · Answer 2 · 2019-05-08

I doubt it will matter. When you have paired end reads, you need the software to understand that if it sees the same read name twice; one read1 and on read2, aligning to the same gene twice, it has to not count those as two separate reads, since they came from one fragment.

That won't be a problem with a single end dataset.