Entering edit mode
6.3 years ago
nanoide
▴
120
Hi all, So I got gene expression values from different RNA-seq experiments. Some are FPKM and some are normalized counts from DESEQ2 I believe I cannot compare these directly. Is there any way to make them comparable? Maybe a way to convert normalized deseq counts to FPKM values?
Thanks
That's what I thought. Unfortunately, these are not my studies and I doubt I can get the raw counts. If i did, I should use DESeqDataSetFromHTSeqCount() and then fpkm(), rigth?
Thank you for your tiime
Yes, but I would avoid FPKM as literature states on multiple occasions that it performs poorly on between-sample comparisons. Check literature on comparisons of normalization methods. I would go with the standard DESeq2 workflow, getting normalized counts from that pipeline.
By default, DESeq2's
fpkm()
will return "robust" FPKMs, which use size factors to normalize instead of taking the column sums of the raw counts.But this would mean that FPKMs are only comparable if all the samples were processed together and contributed to the size factors, right?
Yes. But if the samples are processed separately, you will likely have batch differences. That's a much more complex issue.
Thanks for the useful discussion. Then this means I should use robust = F in fpkm() in deseq? So I'm able to directly compare with other fpkm values? Thanks
Non-robust should be the classic FPKMs. If you check the column sums (
colSums
), they should all add to 1M then.