Question

Single-cell RNA differential expression with an unbalanced number of cells between conditions

0

Entering edit mode

8 weeks ago

Javi.villegas21 • 0

I have a new dataset of Single-nuclei RNA sequencing data from mouse brain tissue in two different conditions (3 treated vs 3 not treated) that I'm analyzing using Seurat. I am trying to run a differential expression analysis for each subtype between the conditions after annotation, but some of the subtypes have a different number of cells in each condition (30 cells in the treated vs 350 in the non-treated in some subtypes for example).

I usually perform pseudobulk before running the DE using DESeq2, but I'm afraid that the imbalance in the number of cells between conditions might be driving the results I am seeing, as the Seurat::AggregateExpression function just aggregates the expression without taking into account the number of cells.

What would people recommend to do in these situations? Should I use a different pseudobulking method? I followed the recommended option by the Seurat vignette, but I'm the only one in my lab doing this analysis and I am not sure that this is the most appropriate method. Thank you very much!

single-cell differential-expression • 382 views

ADD COMMENT • link 7 weeks ago by Javi.villegas21 • 0

score 1 · Answer 1 · 2024-10-08

1

Entering edit mode

8 weeks ago

jared.andrews07 ★ 18k

Generally, I'd just proceed and see if it actually seems to impact things downstream.

You could randomly downsample to get equal cells in each group, but throwing out data for no real reason is generally a bad idea.

In addition, if you're getting consistent swings in proportions, then you might test for differential abundance and come at it from that angle.

ADD COMMENT • link 8 weeks ago by jared.andrews07 ★ 18k

0

Entering edit mode

Thanks! I just realized that grouping the cells as a major subtype or into the minor subtypes that make up the major does change the number of differentially expressed genes between conditions in these groups.

Is there any way to determine which level of grouping is the most optimal for comparisons?

ADD REPLY • link 7 weeks ago by Javi.villegas21 • 0