Question

Does scRNA-seq Tabula Sapiens store: counts or log2( 1+counts ) or most probably something else ?

0

Entering edit mode

3.3 years ago

Alexander ▴ 220

The project "Tabula Sapiens" ( https://tabula-sapiens-portal.ds.czbiohub.org/ ) provides scRNA-seq data for huge number of single cells transcripts ( nearly 500,000 cells from 24 organs of 15 normal human subjects ). One can download count matrices of scRNA-seq data here: https://figshare.com/articles/dataset/Tabula_Sapiens_release_1_0/14267219

I took a brief look on data (https://www.kaggle.com/alexandervc/look-on-tablasapiens-bone-marrow) and a bit puzzled by the following:

Question: What is the format for these count matrices - counts, log( 1+ counts ) or probably something else ?

Example: https://www.kaggle.com/alexandervc/look-on-tablasapiens-bone-marrow?scriptVersionId=69932358&cellId=10

The values are e.g. 1.6892005, 1.7121489

if I take summation by genes or summation of 2**( these expressions) or summation exp(these expressions) : neither of these correspond to 'n_counts_UMIs' - provided by separate column. So that seems to me that it is not neither counts, neither log(counts).

scRNA-seq scRNAseq TabulaSapiens • 1.5k views

ADD COMMENT • link 3.3 years ago by Alexander ▴ 220

score 1 · Answer 1 · 2021-08-05

1

Entering edit mode

3.3 years ago

fracarb8 ★ 1.7k

If you look at the top of the page you linked, you can see that the data you downloaded is from figShare. If you look in the portal you will find that they provide raw data, process data and cellxgene collection. figShare is the processed data. You can also find that the data was processed using scanpy, so the values are likely normalised/scaled counts from scanpy. You should read their publication, or have a better look at their portal, and find exactly how the data was processed.

ADD COMMENT • link 3.3 years ago by fracarb8 ★ 1.7k

0

Entering edit mode

Thank you ! As far as I understand publication is NOT yet available. I am missing mentioning scanpy - where do you see it ? Any way scanpy has several versions of normalizations .

ADD REPLY • link 3.3 years ago by Alexander ▴ 220

0

Entering edit mode

You find the reference to scnpy at the bottom of the page, below [Tabula Sapiens on figshare]1, That's exactly why you need to check how the processed the data. If you cannot find on the portal, you should contact them via email, and ask.

ADD REPLY • link 3.3 years ago by fracarb8 ★ 1.7k

0

Entering edit mode

It is written "to use with scanpy " not "normalized with scanpy". Any way thank you.

ADD REPLY • link 3.3 years ago by Alexander ▴ 220