Question

scRNAseq - DE

0

Entering edit mode

4.3 years ago

garciajessica0311 ▴ 10

Hi, I performed single-cell RNA sequencing of 26 samples. I have fold-change values for all samples, with my genes of interest (n=57). The data has been previously filtered and normalized, . I represented my data in an heatmap but I would like to show that genes are not differentially expressed between my samples. What is the best statistical methodology ? Thanks for your help, Best, Jessica

scRNAseq RNA-Seq DE • 1.4k views

ADD COMMENT • link 4.3 years ago by garciajessica0311 ▴ 10

0

Entering edit mode

When you say 26 samples, do you mean 26 cells, or many cells from 26 samples? When you say you want to show that htey are not DE, do you mean you want a test to show each each gene is not differentially expressed, or you want to test whether they, as a group, are not differentially expressed.

ADD REPLY • link 4.3 years ago by i.sudbery 20k

0

Entering edit mode

I have 26 cells. Yes, I looking for a test that shows that the gene is equally expressed in each cell.

ADD REPLY • link 4.3 years ago by garciajessica0311 ▴ 10

0

Entering edit mode

I have fold-change values for all samples, with my genes of interest (n=57). The data has been previously filtered and normalized, .

How was the data filtered and normalized? It sounds like you are cherry picking genes you want, is that correct?

ADD REPLY • link 4.3 years ago by GenoMax 147k

0

Entering edit mode

The data was filtered with CPM and normalized with VOOM. From normalization step, I have a table with normalized log values for all filtered genes. Among filtered genes, I have filtered the 57 genes.

ADD REPLY • link 4.3 years ago by garciajessica0311 ▴ 10

0

Entering edit mode

What type of single-cell sequencing was this? Chromium? Plate-based?

Where do you derive the fold change values from, i.e. what type of comparison have you done?

ADD REPLY • link 4.3 years ago by Friederike 9.0k

0

Entering edit mode

It's a template-switching method that enables to generate full-lenght cDNA. Sorry, it's log values

ADD REPLY • link 4.3 years ago by garciajessica0311 ▴ 10

2

Entering edit mode

typically, one identifies differentially expressed genes by comparing the values of multiple replicates for, say, two conditions. E.g. expression values from 6 MUT samples vs. 6 WT samples. The replicates are needed because they are assumed to capture the typical biological variability that is to be expected for the expression measurements of a single gene across multiple samples and the null hypothesis typically states that there is no systematic difference between the two groups.

As the many questions and comments illustrate, it is not entirely clear to us what the outcome of your analysis is supposed to be; we'll need more details around your experiment and the underlying question of interest. Do you wish to say that the gene expression values you're capturing across your 26 cells are consistent and in the realm of biological variability? Do you have a set of control genes where you know you should be seeing differences? Why do you not have different conditions that could be compared?

ADD REPLY • link 4.3 years ago by Friederike 9.0k

0

Entering edit mode

To clarify this experiment : I used cell line model to perform scRNAseq. My workflow is divided in 3 parts (time-point) and I recovered cells (at single-cell level or small bulk) at each time-point : - TP1 : triplicates - TP2 : triplicates - TP3 : 20 samples (3 bulks and 17 single-cells) The end-point is to evaluate the impact of the workflow on my cells by comparing transcriptomic data at each time point, including apoptosis pathway. I have filtered 57 genes involved in apoptosis pathway. I am looking for the best statistical method to evaluate whether these genes are differentially expressed or not. We can considered that TP1 is the reference.

ADD REPLY • link 4.3 years ago by garciajessica0311 ▴ 10

0

Entering edit mode

I'm sorry, you lost me; I cannot envision the experimental set up you're describing.

That being said, to test for differential gene expression, you can go several routes. With the few samples that you have, you could either use limma (since you already voom-transformed the data anyway) or you can follow the protocol described in the OSCA resource. Hope that helps!

ADD REPLY • link 4.3 years ago by Friederike 9.0k

0

Entering edit mode

I have fold-change values for all samples

Against what were those values compared to get fold changes?

ADD REPLY • link 4.3 years ago by WouterDeCoster 47k

0

Entering edit mode

Sorry, it's log values not fold-change values

ADD REPLY • link 4.3 years ago by garciajessica0311 ▴ 10