Hi Biostars,
I would like to get FPKM values from normalized read counts for that I am using this formula FPKM(Fragments Per Kilobase per Million) = [# of fragments]/([length of transcript]/1000)/([total reads]/10^6)
My problem is in gtf file most of the geneids having more than one transcript with different lengths. I am confused which one to consider for my calculation.
Any comments will be helpful for my analysis.
Thanks in advance.
Not what you are asking for, but are you sure FPKM is appropriate for your downstream application? It's often not the best normalization method.
An update (6th October 2018):
You should abandon RPKM / FPKM. They are not ideal where cross-sample differential expression analysis is your aim; indeed, they render samples incomparable via differential expression analysis:
Please read this: A comprehensive evaluation of normalization methods for Illumina high-throughput RNA sequencing data analysis
Also, by Harold Pimental: What the FPKM? A review of RNA-Seq expression units