Pan-cancer RNA-seq data normalization
2
0
Entering edit mode
2.2 years ago
Brisket ▴ 10

I have a set of FPKM data of several cancer cell lines and want to compare their gene expression.

Is it ok to transform them into logged numbers or TPM and calculate the fold change?

Or I have to calculate z-score?

normalization RNA-seq • 1.2k views
ADD COMMENT
0
Entering edit mode

Is this data from the same source or did you collect it from independent sources?

ADD REPLY
0
Entering edit mode

Sorry for replying after so long. The data was from CellMiner, it contains 60 cell lines' RNA-seq FPKM data in a single excel sheet. I couldn't find the raw count datao I have to do my analysis based on FPKM.

ADD REPLY
0
Entering edit mode

Is it an option to download the reads and pseudo-align them yourself with kallisto/salmon? Most laptops could do this overnight, as long as you have room to store the .fastq files temporarily (account for approx 6*60 = 360GB of storage).

ADD REPLY
1
Entering edit mode
2.1 years ago
bkleiboeker ▴ 370

Typically it's not good practice to use FPKM (or other such values) for any sort of downstream statistical analyses, including differential expression analysis (read: calculating fold change).

ADD COMMENT
1
Entering edit mode
2.1 years ago

Mathematically, FPKM is not appropriate for comparison between samples. No amount of mathematical transformation will change that.

ADD COMMENT

Login before adding your answer.

Traffic: 1529 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6