Question

Is computing TPM from read count from DEseq2 count matrix necessary?

0

Entering edit mode

5.9 years ago

theodore.killian ▴ 50

I am building an RNAseq pipeline using DESeq2 to show transcriptional differences between a group of RNAseq samples starting from a raw count matrix. Is calculating TPM (from the raw count matrix) unnecessary in this context when I already have the normalized counts produced by the DESeq2 analysis?

Similar posts on this forum seem to suggest that calculating TPM is unnecessary Computing TPM from normalized read count from DESEQ

RNA-Seq R gene • 3.7k views

ADD COMMENT • link 5.9 years ago by theodore.killian ▴ 50

0

Entering edit mode

I have obtained the transcript lengths of the genes from my count matrix. Do I use the mean transcript length to calculate TPM?

ADD REPLY • link 5.9 years ago by theodore.killian ▴ 50

0

Entering edit mode

why do you want to do it manually? Most modern quantification programs (featureCounts, rsem, kallisto, salmon) will do it for you.

It's more complicated than just transcript length (and what is mean transcript length?) - the formula is

transcript length - mean fragment length + 1

(and I'm not sure what happens for transcripts of length smaller than FL)

ADD REPLY • link 5.9 years ago by predeus ★ 2.1k

score 2 · Answer 1 · 2019-05-29

2

Entering edit mode

5.9 years ago

predeus ★ 2.1k

Depends on what you want to do with it. It's not needed for differential gene expression (and in fact should not be used for DE)

TPMs are used for different things - like clustering, classification of gene expression strength, etc

ADD COMMENT • link 5.9 years ago by predeus ★ 2.1k