Question

Limma Differential Expression with Duplicate Correlation

0

Entering edit mode

2.2 years ago

Bine ▴ 90

Dear all,

After some attempts with DESEQ2, I have now moved to Limma to solve my issue with 3 samples per individual in my Differential Expression Analysis. I have to adjust my model for age, sample site and diagnosis. I am interested in differentially expressed genes in sex (male versus female).

I followed this tutorial under "Limma Analysis": https://bioc.ism.ac.jp/packages/3.14/bioc/vignettes/variancePartition/inst/doc/dream.html

I wonder if this code makes sense now, I dont quite understand why sometimes people put "~0 + AGE.GROUP + ..." What is the +0 doing, is it fine without?

# apply duplicateCorrelation is two rounds
design = model.matrix( ~ AGE.GROUP + SAMPLE.SITE + DIAGNOSIS + SEX, metadata)

vobj_tmp = voom( geneExpr, design, plot=FALSE)
dupcor <- duplicateCorrelation(vobj_tmp,design,block=metadata$INDIVIDUAL)

# run voom considering the duplicateCorrelation results
# in order to compute more accurate precision weights
# Otherwise, use the results from the first voom run
vobj = voom( geneExpr, design, plot=FALSE, block=metadata$INDIVIDUAL, correlation =dupcor$consensus)

# Estimate linear mixed model with a single variance component
# Fit the model for each gene, 
#dupcor <- duplicateCorrelation(vobj, design, block=metadata$INDIVIDUAL)

# But this step uses only the genome-wide average for the random effect
fitDupCor <- lmFit(vobj, design, block=metadata$INDIVIDUAL, correlation=dupcor$consensus)

# Fit Empirical Bayes for moderated t-statistics
fitDupCor <- eBayes( fitDupCor )

Thank you so much, Bine

Duplicate Differential Analysis Limma Expression Correlation • 2.4k views

ADD COMMENT • link 2.2 years ago by Bine ▴ 90

score 3 · Answer 1 · 2022-09-25

The 0 in the design formula specifies whether to include an intercept term in the module or not. If included the interscept term generally represents the expression in the baseline condition, with other coefficients representing logfoldchanges from this baseline condition. With no intercept there is no baseline condition, and log2foldchanges are calcuated as the difference between coefficients for the two conditions.

So, as far as I'm aware, at least in standard fixed effect setting, to include a intercept or not is a question of how contrasts are encoded, not what the final results are.