Hi,
I have an de novo RNA-seq dataset that I have run RSEM on the gene-level and gotten out FPKM for all Trinity genes in the assembly. Unfortunately, the assembler split up a real gene (say, a transporter) into two separate Trinity genes in the assembly, so the first Trinity gene contains the first half of the sequences and the second Trinity gene contain the second half. Kind of like this:
-------------------------------------------------
|--------------------| |------------------------|
I have the FPKM values for both of these Trinity genes separate. How can I calculate the FPKM for both of these Trinity genes if I would like to get the combined FPKM for both of them? Can FPKM simply be added together so FPKM(tot) = FPKM(Trinity gene 1) + FPKM(Trinity gene 2)? Or does it require a more complicated procedure?
I ask since I am not sure if it matters in what order length and library size normalization and combination of expression data from two Trinity genes are done. Are these operations commutative? If not, is it better to first add the raw expression values from the two Trinity genes and then normalize?
https://www.mathsisfun.com/fractions_addition.html