Hello,
I have been recently pointed to a lecture by Lior Pachter:
where he states that FPKM normalization method is not completely correct or at least not suitable for differential expression analysis.
Could somebody please elaborate on that? In my study, I would like to compare different experimental conditions (pairwise), all with replicates. Is FPKM inappropriate in such design?
Also, if I use Galaxy, how and at which step can I deal with it? How can I transform my FPKM values to TPMs e.g.?
Looking forward to learn,
Regards,
Monika
The part on FPKM of the video starts at about 30m. Unfortunately, a youtube video of a lecture is not a citable peer-reviewed article, but there is a review in Brief. in Bioinformatics, I remember.
See also Does FPKM scale incorrectly in case of unequal mapping rates?
An update (6th October 2018):
You should abandon RPKM / FPKM. They are not ideal where cross-sample differential expression analysis is your aim; indeed, they render samples incomparable via differential expression analysis:
Please read this: A comprehensive evaluation of normalization methods for Illumina high-throughput RNA sequencing data analysis
Also, by Harold Pimental: What the FPKM? A review of RNA-Seq expression units