Hi guys, I have a simple question. I have RNA-Seq data from different batches. As suggested looking at many posts on-line I have pre-normalized my data (using the TMM from edgeR) then I have corrected them using Combat and then I have re-normalized them (for the library-size) using DESeq2. My question is: is it correct the second normalisation after Combat? Or at least is it not dramatically not-correct?
Thank you in advance
Best
Dear Kevin, the design of the experiment of the posted question you redirected me to, is al little bit different from my case because I don't have nested design/s. In any case, people suggest, generally, to pre-normalize data in order to remove some high-level variability and then perform batch-correction. I agree with you about the way to correct, i.e. basically using the batch as a covariate. My question is if the normalization after the correction that is basically a ratio of the genes by the library size of each sample is wrong or it is expected not to affect dramatically the identification of variable genes across conditions (i.e. DEGs). Thank you a lot for your help!
You have accepted that answer from rpolicastro; so, I will assume that the problem has been addressed and avoid responding further.