Differential expression bias in Seurat
1
0
Entering edit mode
5 months ago

Hello everyone! I am dealing with a scRNA-seq dataset having cells (SKOV3 cell culture) in 4 different conditions (2 levels of treatment). There are some biases: one of the conditions has 10 times more cells than others, whereas 3 others had doublets which have been filtered out.

Basically, clustering gives me the same result as sample annotation. clustering results samples My primary interest was detecting marker genes between clusters and/or conditions. However, running FindAllMarkers gives me a table where for each cluster there is a strong bias towards either upregulated, either downregulated genes one cluster markers. another cluster markers I tried to sample the biggest condition so all of them would be of the same size, but it didn't fix my problem.

Is it a common problem with single-cell data? Is it possible to fix it? Any advices would be appreciated.

Thanks!!

markers seurat scRNA-seq single-cell • 355 views
ADD COMMENT
1
Entering edit mode
5 months ago

This is like due at least in part due to double dipping resulting in deflated p-values, which you can read more about in the OSCA book. As such, it's often more effective and robust to focus on effect size metrics when identifying cluster markers.

Your large sample could also be biasing things, and you should think about whether integration makes any sense in this case. Normally, I'd recommend pseudobulking and traditional bulk RNA-seq methods for cross-condition comparisons, but you need replicates for that.

ADD COMMENT

Login before adding your answer.

Traffic: 2313 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6