Salmon TPM calculation constant
1
1
Entering edit mode
22 months ago
shinyjj ▴ 50

Hi all, salmon seems to calculate the TPM using the equation below, and looks like the constant is 26.1 for every calculated TPM. Does anybody know what this constant means and how it's derived?

TPM = constant * NumReads / EffectiveLength,
salmon TPM • 1.4k views
ADD COMMENT
1
Entering edit mode
22 months ago
Rob 6.9k

The salmon TPM calculation is

TPM_i = 10^6 * [ (NumReads_i / EffectiveLength_i) / (sum_j (NumReads_j / EffectiveLength_j)) ]

Where NumReads_i is the expected number of reads arising from transcript i, EffectiveLength_i is its effective length, and likewise for j. Can you say how you are deriving this constant of 26.1? Are you looking over _all_ transcripts in the resulting quant.sf file? That number should (barring machine precision) sum to 1,000,000.

ADD COMMENT
2
Entering edit mode

My guess is that its 10^6/(sum_j (NumReads_j/EffectiveLength_j))?

ADD REPLY
0
Entering edit mode

Hi Rob, you are correct. It does sum up to 1M. Thanks a lot. Since you are expert in this tool, could you also help me answer this question here? Intron retention gene expression using salmon

ADD REPLY

Login before adding your answer.

Traffic: 1942 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6