Question

Can I perform DE analysis with Seurat if I only have one sample per condition?

0

Entering edit mode

2.1 years ago

bioinfo ▴ 150

Hello,

I have single cell RNA seq data from 2 samples. One is control and the other one is treated. I am trying to analyze the data with Seurat. I did the QC analysis, normalized each sample and then I did the integration. I got the clusters and assigned cell types. Now I would like to do the DE analysis between the control and treated CD4 cells. Can I do DE analysis if I only have one sample per condition? Does it make sense statistically to do it? I am planning to do it anyway as exploratory data analysis but I was wondering how much can I trust the p values that come out of that test.

Thank you

single RNA Seq cell seurat • 1.3k views

ADD COMMENT • link updated 2.1 years ago by LChart 4.5k • written 2.1 years ago by bioinfo ▴ 150

score 2 · Answer 1 · 2022-10-19

2

Entering edit mode

2.1 years ago

swbarnes2 14k

It will work, because you can think of each cell as one "sample". You've got lots of cells to compare to lots of cells.

ADD COMMENT • link 2.1 years ago by swbarnes2 14k

2

Entering edit mode

This will tell you the confidence you have between the two samples. This will not tell you anything about the difference between the two populations (conditions) the samples are drawn from. There is not enough information here to understand the population variance of the cluster means.

Specifically you need enough information to infer the parameters of the hierarchical model:

condition_expr ~ N(mu_c, sd_c)
sample_offset ~ N(mu_s, sd_s)
cell_noise ~ N(0, sd_e)
expr_cell = condition_expr + sample_offset + cell_noise

so this means minimally > 1 cell per sample, and > 1 sample per condition.

ADD REPLY • link 2.1 years ago by LChart 4.5k

0

Entering edit mode

(+1) You say:

this means minimally > 1 cell per sample

This is correct if you are interested in the differences between samples. However, if one is interested only in the difference between conditions it's ok, in theory, to have 1 cell per sample and many samples per condition. Effectively differential expression on bulk RNAseq works in this way. Do I get it right?

ADD REPLY • link 2.1 years ago by dariober 15k

0

Entering edit mode

True, as far as it goes. Practically speaking you'll always have some replication at the cell level.

ADD REPLY • link 2.1 years ago by LChart 4.5k