Question

Differential expression bias in Seurat

0

Entering edit mode

11 months ago

sleepystudent • 0

Hello everyone! I am dealing with a scRNA-seq dataset having cells (SKOV3 cell culture) in 4 different conditions (2 levels of treatment). There are some biases: one of the conditions has 10 times more cells than others, whereas 3 others had doublets which have been filtered out.

Basically, clustering gives me the same result as sample annotation. clustering results samples My primary interest was detecting marker genes between clusters and/or conditions. However, running FindAllMarkers gives me a table where for each cluster there is a strong bias towards either upregulated, either downregulated genes one cluster markers . another cluster markers I tried to sample the biggest condition so all of them would be of the same size, but it didn't fix my problem.

Is it a common problem with single-cell data? Is it possible to fix it? Any advices would be appreciated.

Thanks!!

markers seurat scRNA-seq single-cell • 535 views

ADD COMMENT • link updated 11 months ago by jared.andrews07 ★ 19k • written 11 months ago by sleepystudent • 0

score 1 · Answer 1 · 2024-07-02

This is like due at least in part due to double dipping resulting in deflated p-values, which you can read more about in the OSCA book. As such, it's often more effective and robust to focus on effect size metrics when identifying cluster markers.

Your large sample could also be biasing things, and you should think about whether integration makes any sense in this case. Normally, I'd recommend pseudobulking and traditional bulk RNA-seq methods for cross-condition comparisons, but you need replicates for that.