Hi all,
I'm currently doing differential expression analysis on several public RNAseq datasets coming from different studies.
Each one having samples with two conditions : "responders" and "non responders".
I'm wondering if I should analyse each dataset separately or if I should mix all samples in one meta-dataset, while correcting for batch effects.
When I analyse each dataset separately, I don't have any intersection of genes between each results. With that in mind, would it be possible to get differentially expressed genes if I mix all datasets? Or is it nonsense to expect that?
Thank for any input
Zero overlaps between sets is concerning. How consistent are the experimental designs?
The experimental designs are quite consistent on the paper, but as it is tumor biopsies samples, I'm afraid the different procedures associated with samples treatment can bring a lot of noise.