How to summarize the expression of a gene when having expression data from different transcripts of that gene
1
2
Entering edit mode
21 months ago
jeni ▴ 90

Hi!

Imagine I have normalized expression values of two different transcripts from the same gene, but I am insterested in studying the expression at the gene level. Which is the best way to get this? Do you think it would be correct to calculate the mean of the expression levels from those two transcripts? What if a transcript is not expressed and other is?

Thanks!

rnaseq expression • 1.0k views
ADD COMMENT
0
Entering edit mode
21 months ago
ATpoint 85k

Typically gene-level summarization is done by summing the counts of the transcripts per gene. Check the tximport package over at Bioconductor.

ADD COMMENT
0
Entering edit mode

tximport does something a bit more clever than simply summing up (it factors transcript length into consideration).

If you're given TPMs, you can simply sum those up.

ADD REPLY
0
Entering edit mode

tximport will calculate the weighted effective transcript length, but it doesn't use that in its calculation of gene level expression. Both counts and TPMs are just summed. The effective transcript lengths (which is sum(tx_expression*tx_length/gene_expression)) is then used as an offset in differential testing platforms.

ADD REPLY

Login before adding your answer.

Traffic: 2531 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6