Two peak in histogram before single cell RNA-seq QC & Filtering
1
0
Entering edit mode
2.8 years ago

Hi all,

I am analyzing the scRNA-seq data on immune cells in brain and PBMC. First, before QC and Filtering, I checked the distribution of nFeature (nGene) and uCount (nUMI) with histogram. But, in PBMC data, two peaks were identified in the histogram of nFeature. One occurred at very low values, such as 100, and the other occurred at values above 1000.

I'm newbie, so I haven't been able to deal with a lot of data. Is this a common result? Or is there something wrong with the data? In order to remove low quality cells or empty droplets, cells with low expression are usually removed (ex nFeature < 100 ~ 500). However, I don't know what to do in this situation. How should the criteria be set for removal of low quality cells or empty droplets?

enter image description here

QC PBMC scRNAseq histogram • 1.6k views
ADD COMMENT
0
Entering edit mode

Bl is Blood (PBMC), Br is Brain, CT is Control, GT is Case.

ADD REPLY
0
Entering edit mode
2.8 years ago

Looks to me like your blood samples are just lower quality/have more empty droplets/dead cells. I'd remove those two lower peaks outright and use a MAD-based threshold approach for the rest.

ADD COMMENT
0
Entering edit mode

Thank you for reply! So, your answer means that the peaks at the low value is not due to wrong data or the characteristics of the blood sample, but just because there are many low quality cells (empty droplets/dead cells). Right?

ADD REPLY
0
Entering edit mode

That's my interpretation, though other QC metrics (mito %, # cells, nCount) would be helpful as well. Generally, if you fail one metric, you fail multiple for a given droplet.

ADD REPLY
0
Entering edit mode

Thanks a lot!!

ADD REPLY

Login before adding your answer.

Traffic: 2778 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6