Rna-Seq Transcript Length Bias - Is There A 'Standard' Approach To Accounting For It
2
6
Entering edit mode
13.1 years ago
Travis ★ 2.8k

The tendency for longer genes (and their GO categories etc) to have a greater likelihood to be classified as differentially expressed in RNA-Seq is well known but I'm wondering what people commonly do to account for it? I am aware of a few methods out there (GOSeq, NOISeq etc) but I get the feeling that the most common approach is still to ignore it. Am I wrong in thinking this? One of the most commonly used RNA-Seq toolkits is still the 'Tuxedo Suite' and unless I am mistaken it does not account for length bias in any way.

I should point out that RPKM/FPKM as normalisation methods do not address this problem. Differential expression based on these values is more likely for longer transcripts/genes.

All comments welcome.

rna data next-gen sequencing • 5.2k views
ADD COMMENT
7
Entering edit mode
13.1 years ago
seidel 11k

FPKM is a normalized unit for describing a rate of observance, but it doesn't address the observation that in evaluating differential expression, longer genes are usually more likely to be classified as differentially expressed. FPKM alone doesn't address this. Bullard et al. (2010) put it this way:

"There is clearly a positive association between gene counts and length, an association that is not entirely removed via scaling by gene length, as in the RPKM" (from "Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments", http://www.biomedcentral.com/1471-2105/11/94)

I think they weight their t-statistics by gene length to try to correct for it. In my experience it is largely ignored at this point, and there is no standard approach taken by most people I talk to.

ADD COMMENT
2
Entering edit mode

so if it isnt accounted for, shouldnt it be? and what would be the ideally perfect method. Im not clear about why correcting for transcript length wouldnt fix this

ADD REPLY
2
Entering edit mode
13.1 years ago

I'm no expert... but I think that's the whole point of FPKM?

http://cufflinks.cbcb.umd.edu/howitworks.html#hqua

ADD COMMENT
0
Entering edit mode

Incorrect. See below...

ADD REPLY

Login before adding your answer.

Traffic: 1778 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6