TPM for correlation between genes
2
0
Entering edit mode
4.4 years ago

Hi everyone,

I normally use DESeq2 for normalization for analyses such as regressions but have only access to a TPM dataset (no raw counts available).

Does it make sense to use TPM for correlation between two genes? Or should I quantile normalize first?

rnaseq • 2.7k views
ADD COMMENT
1
Entering edit mode
4.4 years ago

Well I would say both TPM or FPKM are valid measures of gene expression comparisons. Both are normalized by gene length and read depth, making them suited for relative expression comparison.

For more info: simple review.

ADD COMMENT
0
Entering edit mode
4.4 years ago
ben.kunfang ▴ 30

I don't think TPM data works for DESeq2, DESeq2 perform an internal normalization.

ADD COMMENT
0
Entering edit mode

Yes I'm aware. To clarify, I meant I would normally normalize my dataset with DESeq2, but because my dataset is TPM, I cannot do that so I was wondering if TPM is appropriate for correlation between two genes.

ADD REPLY
0
Entering edit mode

If you are looking for linear correlation such as Pearson then it should not matter too much which normalization you use since all these linear methods (per-million, RLE, TMM...) perform a linear scaling by a single factor. Quantile normalization is a different story as QN forces the distributions to be identical and this obviously changes the correlations between samples. It would of course be better to have a method such as tximport in your pipeline which corrects for the relative gene length depending on the isoforms that are being expressed but if you only have TPM then you do not have that choice.

ADD REPLY

Login before adding your answer.

Traffic: 2022 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6