cBioPortal data with negative log2(FPKM + 1) values
1
0
Entering edit mode
17 months ago
LauferVA 4.5k

Hello, I am looking at data downloaded from cBioPortal directly.

The metadata for the mrna-seq files asserts the data are log2(FPKM + 1), but there values as small as -0.3 in the matrix.

I am thinking maybe they did log2(FPKM) + 1 instead of log2(FPKM + 1), or maybe they forgot the 1 altogether, who knows.

Has anyone dealt with this before, i.e. has it been reported for other datasets downloadable from cBioPortalData?

Thank you
VAL

FPKM cBioPortal • 1.1k views
ADD COMMENT
1
Entering edit mode

I would contact them. MSKCC is a reputable organisation but not immune to making grave errors.

ADD REPLY
0
Entering edit mode

Copy. Thank you, Kevin Blighe

ADD REPLY
0
Entering edit mode
17 months ago
Zhenyu Zhang ★ 1.2k

I don' know if they have updated their algorithm recently. A few years ago when I looked at cBio RNA-Seq data, I believe that was z-score normalization.

ADD COMMENT
0
Entering edit mode

they still use this, but it is in a separate table. the rna_seq_v2_mrna column is described as log2( FPKM + 1 ) in the metadata

ADD REPLY
0
Entering edit mode

Interesting. Anything called "rna_seq_v2" in the TCGA world usually means the raw input is legacy hg19 RSEM calls.

ADD REPLY
0
Entering edit mode

ya i think it is a mistake.

this is not the only dataset like this

ADD REPLY

Login before adding your answer.

Traffic: 1468 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6