Question

Combining different studies from recount for differential gene expression analysis

0

Entering edit mode

9 months ago

MieszkoL ▴ 10

Dear all,

I'd like to combine different datasets (TCGA-GBM, TCGA-LGG, SRP027383, GTEX brain) from recount-brain to perform differential gene expression analysis with DESeq2 (+ use normalized counts for visualization purposes).

So far I've studied following topics and workflows:

https://support.bioconductor.org/p/9151174/

https://support.bioconductor.org/p/9143498/

http://research.libd.org/recount-brain/example_multistudy/recount_brain_multistudy.html

http://research.libd.org/recount-brain/example_SRP027383/example_SRP027383.html

But I'm still not sure whether it's possible and if yes, how to do it. Do I need to separate glioblastoma from control tissues prior to normalizing the data to remove dataset specific effects? Separate tumors depending on grading?

How about normalizing for dataset and adjusting for PCs before feeding the data to DESeq2 as in: http://research.libd.org/recount-brain/example_multistudy/recount_brain_multistudy.html#normalize-for-dataset (Points 3.4 and 3.5)

Contrary to this topic: https://support.bioconductor.org/p/9151174/ some authors seemed to go through with it: https://www.nature.com/articles/s41598-023-31180-z

Would running ComBat-seq with passing dataset to covar_mod be sufficient prior to DESeq analysis? (after verifying with PCA)

I'd really apreciate your guidance with this!

ComBat-seq batch DESeq2 recount • 367 views

ADD COMMENT • link updated 9 months ago by ATpoint 87k • written 9 months ago by MieszkoL ▴ 10