Question

genefu for PAM50 prediction

1

Entering edit mode

6.2 years ago

silviajserrano ▴ 50

For the genefu package to do the PAM50 prediction from RNA-seq data, my question es, should I use raw data (read counts) or should I transform my data prior the prediction?

RNA-Seq breast cancer subtyping • 3.1k views

ADD COMMENT • link updated 7 months ago by hamarillo ▴ 80 • written 6.2 years ago by silviajserrano ▴ 50

score 2 · Answer 1 · 2019-07-14

2

Entering edit mode

5.4 years ago

darklings ▴ 580

Values that have been normalized and transformed for between sample comparison are appropriate.

I use the expression data that are median ratio normalized and variance stabilizing transformed (functions in DESeq2) from the read counts to do the PAM50 classification, then I compared the results predicted from TPM values, they are quite similar while there are still some samples (< 10) assigned to a different class.

ADD COMMENT • link 5.4 years ago by darklings ▴ 580

1

Entering edit mode

I use the expression data that are median ratio normalized and variance stabilizing transformed (functions in DESeq2) from the read counts to do the PAM50 classification

Hi. For running PAM50 analyses, do you 1) normalize your entire gene expression matrix or rather 2) select only the subset of genes that PAM50 uses for classification and then do the normalization?

ADD REPLY • link 2.8 years ago by wstfljs ▴ 100

1

Entering edit mode

I normalized the entire expression matrix first then used those 50 genes for doing PAM50

ADD REPLY • link 2.7 years ago by darklings ▴ 580

0

Entering edit mode

But why do you normalize? wouldn't each sample be treated independently to classify its pam50 status?

ADD REPLY • link 7 months ago by hamarillo ▴ 80