Hi all, I have some questions regarding RNA seq analysis if you can suggest anything it will help me a lot.
I am currently normalizing RNA seq data for comparing genes expression within and between samples. Which normalization method would you recommend for this type of analysis? FPKM, TPM, TMM? Also I want to make a heatmap to see genes expressed in different conditions. Do you think transforming normalized data (like log2, z-score) is a good idea for this?
Also I want to build a co expression network, I am just wondering if normalization like FPKM, TPM, TMM has any influence on building a coexpression network?
Another thing I want to do is use Pearson correlation but I am confused if it will only find linear relationships among the normally distributed data. But normalization methods do not assume that counts to be normally distributed. So, Is it a bad idea to find pairwise coexpression of genes using pearson correlation? If so, which method would you recommend is reliable for building coexpression networks with which normalization method?
Please help me with this, I will be very grateful to you.
Thanks in advance
For your first question - I recommend you to check edgeR or deseq2
The second question - you can use
WGCNA
R package for co-expression network analysis. you have to use normalized data like log2(counts+1) or FPKM, logCPM data as input.Thanks I will take a look at edgeR and deseq2. I have 16 samples, Is it a good idea to use WGCNA for my analysis? because author recommended more than 20 samples for WGCNA.