Hello everyone, I am working on scRNA-Seq data analysis and I have a technical question. We can combine different scRNA-Seq experiments with batch correction methods such as MNN or CCA. As I know, while doing differential expression analysis we should consider batch effect like Scater/Scran package provided a block parameter to do analysis with batch.
But the point is, if out data sets comes from different conditions (let's say healthy and disease) and real source of batch effect is the condition and we want to compare the transcriptomes of specific cell types between conditions, what should we do? We cannot block or do correction for batch since we want to see the effect of batch to specific conditions.
Treating scRNA-Seq data as bulk RNA-Seq data and use raw-counts (after deletion of non-expressed genes of course) with methods such as DESeq2 or edgeR, would it be okay?
Thank you in advance.
Do you have a real batch effect or are you just concerned about how to match clusters of cells between samples?
I have real batch effect. I have 2 data sets for 2 conditions. Their experimental methods, platforms and tissues are same only conditions are different. I identified the cell types of clusters in data sets seperately now I want to compare specific cell types between conditions.
I don't think it is possible to differentiate between batch effects and biological effect in this case. If you performed the experiment with biological conditions side by side in two batches, then you can try to look for batch effects.