how to filter the scRNAseq data, by percent genes expression, counts and others
1
0
Entering edit mode
15 months ago
alwayshope ▴ 40

Dear experts,

How to filter the scRNAseq data? When using the only keep the genes that have expression in above 10% of all cells, near 8,000 genes left, when only keep genes that have expression in above 50% of all cells, near 4000 genes left.

So which way is better? Use a relatively loose threshold that genes have expression in above 10% of all cells ?

Thank you very much!

scRNA-seq • 734 views
ADD COMMENT
1
Entering edit mode
15 months ago

These types of questions have been asked here probably a dozen times. In short, nobody can tell you specific numbers, you must let the data guide you.

Limiting to genes expressed in half of all cells is a very bad idea. I don't see a reason to filter genes beyond the absolute bare minimum. Even the 10% requirement is a bad idea, as what if you have a given population of cells that account for a small fraction of cells (say 5%) that all express it?

Don't throw out data with no reason.

ADD COMMENT
0
Entering edit mode

Thanks a lot for your kind guidance!

Yes, the caveat for this filter is that it may miss some important genes that expressed at a small fraction of cells...

Seems it's better not to filter out for this extra filter, the tutorial only have filter for nFeature, nCount, Mt ratio which to do the basic filtering.

Best!

ADD REPLY

Login before adding your answer.

Traffic: 2160 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6