Question

Can I convert normalized counts from DESEQ2 into fpkm?

0

Entering edit mode

6.3 years ago

nanoide ▴ 120

Hi all, So I got gene expression values from different RNA-seq experiments. Some are FPKM and some are normalized counts from DESEQ2 I believe I cannot compare these directly. Is there any way to make them comparable? Maybe a way to convert normalized deseq counts to FPKM values?

Thanks

RNA-Seq deseq normalization fpkm • 5.3k views

ADD COMMENT • link 6.3 years ago by nanoide ▴ 120

score 1 · Answer 1 · 2018-08-07

1

Entering edit mode

6.3 years ago

ATpoint 85k

Do you have access to the raw counts? The point with DESeq2 counts is that the size factors change depending on which samples you process together. Should start with raw counts of all experiments if possible.

ADD COMMENT • link 6.3 years ago by ATpoint 85k

0

Entering edit mode

That's what I thought. Unfortunately, these are not my studies and I doubt I can get the raw counts. If i did, I should use DESeqDataSetFromHTSeqCount() and then fpkm(), rigth?

Thank you for your tiime

ADD REPLY • link 6.3 years ago by nanoide ▴ 120

3

Entering edit mode

Yes, but I would avoid FPKM as literature states on multiple occasions that it performs poorly on between-sample comparisons. Check literature on comparisons of normalization methods. I would go with the standard DESeq2 workflow, getting normalized counts from that pipeline.

ADD REPLY • link 6.3 years ago by ATpoint 85k

1

Entering edit mode

By default, DESeq2's fpkm() will return "robust" FPKMs, which use size factors to normalize instead of taking the column sums of the raw counts.

ADD REPLY • link 6.3 years ago by igor 13k

1

Entering edit mode

But this would mean that FPKMs are only comparable if all the samples were processed together and contributed to the size factors, right?

ADD REPLY • link 6.3 years ago by ATpoint 85k

1

Entering edit mode

Yes. But if the samples are processed separately, you will likely have batch differences. That's a much more complex issue.

ADD REPLY • link 6.3 years ago by igor 13k

0

Entering edit mode

Thanks for the useful discussion. Then this means I should use robust = F in fpkm() in deseq? So I'm able to directly compare with other fpkm values? Thanks

ADD REPLY • link 6.3 years ago by nanoide ▴ 120

1

Entering edit mode

Non-robust should be the classic FPKMs. If you check the column sums (colSums), they should all add to 1M then.

ADD REPLY • link 6.3 years ago by igor 13k