Hi,
There are several questions after I read the manual of CQN normalization method. Although I also have checked several related posts of Biostar, but still confused a lot.
How to get the information of gene length? I think it is easy to calculate as the gene bands can be obtained directly from Ensembl website (end bp - start bp + 1?). But it seems that it is more scientific to sum all of the exonic bands for each gene.
How to get the information of GC % content? Unlike gene length, Ensembl website directly gives the GC % contents. But if the gene length is not calculated as I think, they also can not be used.
If no GC bias and gene length bias occur while CQN normalization method is used, what effect will be caused?
Are the residual values after CQN log2-scaled RPM by default?
In sum, I want to know the most exact gene length and GC % content.
Thanks,