Question

transform RPM to TPM

0

Entering edit mode

8.0 years ago

universtiycam2015 • 0

Dear all, I am new to this, and I’d like suggestion on how can I transform RPM to TPM, actually to [log2(tpm+1)] for all my RNA gene expression file using r. here is the heder and first row of the data frame that has 63677 rows and 70 columns (samples subjected to RNA-seq):

genename PT1 PT2 PT3 PT4 MALAT1 53319.14365 25123.86039 30064.91672 62220.13144

Many thanks!

RNA-Seq R • 6.8k views

ADD COMMENT • link updated 8.0 years ago by liangjiao.xue ▴ 100 • written 8.0 years ago by universtiycam2015 • 0

0

Entering edit mode

Not an answer to your question but a remark: TPM/RPKM/FPKM are often not the best normalization method... (But I don't know what you are going to do with the data)

ADD REPLY • link 8.0 years ago by WouterDeCoster 47k

0

Entering edit mode

If you have mapped read counts from htseq_count or any other tool you can directly calculate rpkm in R using the following script.

https://github.com/sethiyap/CalculateTpmSingleSample/blob/master/CalculateTpmSingleSample.R

Cheers, Pooja

ADD REPLY • link 8.0 years ago by poojasethiya ▴ 120

score 2 · Answer 1 · 2017-01-12

TPM, FPKM, RPKM are calculated from raw read counts. It will be easier to start from raw counts rather than to convert from different calculations.

If RPM is "reads per million total/mapped reads", it's same as TPM (transcripts per million reads) for single-end platforms. For paired-end reads, singletons (only one end is mapped) and paired-mapped read pairs could be treated differently. This depends on how the software handles the numbers. Some tools may treat RPM same as TPM. some may be not. So start from raw data is necessary.