FPKM values for DE analysis - Why?
1
0
Entering edit mode
13 months ago
Thalita • 0

Hello everyone!

I believe my questions are quite naive, but I am super new to RNA-seq data analysis, so forgive me! I already saw some questions regarding this subject, but I still do not understand it well. I am currently performing (trying to, actually) an analysis to identify differently expressed genes between two groups (with cancer, no normal samples in any group) from TCGA dataset. I've downloaded TCGA transcriptome data with FPKM-unstranded values, as the data is already normalized (is it right?!). However, when looking for a pipeline to perform this analysis, I noticed that DESEq2 or edgeR/limma analysis only uses raw counts as input to identify these genes. Why I should not be performing DE with FPKM values? I mean, which kind of analysis are FPKM values okay to perform with? Many many thanks in advance.

fpkm rna-seq • 1.1k views
ADD COMMENT
1
Entering edit mode
13 months ago
bk11 ★ 3.0k

I've downloaded TCGA transcriptome data with FPKM-unstranded values, as the data is already normalized (is it right?!).

Yes, these data are normalized.

Why I should not be performing DE with FPKM values?

While FPKM normalization method accounts for sequencing depth and gene length, FPKM data are not recommended for DE-analysis. The reason is that the normalized count values output by the FPKM method are not comparable between samples.

For your reference, I would like you to check this link.

ADD COMMENT
0
Entering edit mode

Guess I'm gonna adventure myself with the raw counts then. The link was super helpful btw, thank you!

ADD REPLY

Login before adding your answer.

Traffic: 1867 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6