Question

How to get normalized counts corrected for variables in the design matrix of DESeq2 as an input for WGCNA

0

Entering edit mode

21 months ago

Sara ▴ 40

Hi everyone,

I am analyzing Bulk RNA-Seq data, I generated the counts with HTSeqcount and I analyzed them with DESeq2.

Then, I want to get the normalized counts corrected for the variables in my design matrix (Like age, sex, RIN, ...) to use it as an input for WGCNA.

But I realized when I am using variance normalized counts (vst-counts) from DESeq2 they are not corrected for variables in my design matrix.

I was wondering which normalized counts I can use that can be corrected for variables in my design matrix, therefore it can be used as an input for WGCNA?

I also tried to use "limma::removeBatchEffect" but still, it didn't really correct my count for those variables.

I would be grateful if anyone could provide/recommend any solutions, please!

Here is the code that I used in my DESeq2:

dds <- DESeqDataSetFromHTSeqCount(sampleTable=sampleTable,
                                  directory=folder,
                                  design=~Plate+RIN+Sex+Age+condition+PC3+PC2+PC1)

dds <- estimateSizeFactors(dds)
keep <- rowSums( counts(dds) >= 10 ) >= 20 #filters out genes 
dds <- dds[keep,]

colData(dds)$condition <- relevel(colData(dds)$condition, ref = "Control")

dds<- DESeq(dds)

vsd <- vst(dds, blind = FALSE)

I want to get the normalized counts adjusted for sex, RIN, plate, age, PC1, PC2, PC3

Thank you in advance!

WGCNA vst DESeq2 • 1.9k views

ADD COMMENT • link 20 months ago by Sara ▴ 40

score 2 · Answer 1 · 2023-08-11

2

Entering edit mode

21 months ago

andres.firrincieli 3.9k

vst doesn't perform any batch correction. Check these posts on how to use vst to get the normalized transformed count data for WGCNA: link1;link2.

Regarding WGCNA, you can follow the module-trait relationship analysis to check which modules correlate with RIN, sex, age etc...

ADD COMMENT • link 21 months ago by andres.firrincieli 3.9k

0

Entering edit mode

Thanks for your comment. yes, that's my point the vst doesn't perform the batch correction; even after using limma::removeBatchEffect still, the counts are somehow the same. And in WGCNA I see modules that are highly correlated with sex, age, and cell type proportions, ... Are there other methods or normalized counts that can be corrected for the batches that I can use as input for WGCNA?

ADD REPLY • link 20 months ago by Sara ▴ 40

score 1 · Answer 2 · 2023-08-11

1

Entering edit mode

21 months ago

bk11 ★ 3.1k

Please correct your title there is typo WGCAN for WGCNA. You can get normalized counts as follows-

normalized_counts=counts(dds, normalized=TRUE)

ADD COMMENT • link 21 months ago by bk11 ★ 3.1k