I used calcNormFactors
from edgeR
to normalize my RNA seq data matrix with raw counts. Here is the code I am using. d_des
is my design matrix and cm
is my contrast matrix
counts_matrix <- as.matrix(raw_counts_dt, rownames = TRUE)
de_List <- DGEList(counts_matrix)
de_List %<>% calcNormFactors
res_Voom <- voom(de_List, d_des, plot = TRUE)
lm_Fit <- lmFit(res_Voom, d_des)
eb_Fit <- eBayes(contrasts.fit(lm_Fit, cm), trend = F, robust = T)
I have the following questions:
- If I understand the document correctly,
TMM
is the default normalization method? Is this the correct method to be used for normalizing raw counts to be used for DE analysis? If not which one should be used?
what is it normalized for? It is normalizing for all genes in my list across all samples ie. replicates for each groups?
Is the
voom
using the normalized counts from myde_List
or is it still using the raw counts? If it is using raw counts, how could I use the normalized counts?I am using
res_Voom$E
matrix to create the PCA plot to compare my conditions. Is this using the normalized counts or the raw counts? how do I use normalized counts if it is not?I am using
topTable
fromeb_Fit
to list the top differentially expressed genes. Is this the correct table to use?
Thank you in advance for your answers!