Does scRNA-seq Tabula Sapiens store: counts or log2( 1+counts ) or most probably something else ?
1
0
Entering edit mode
3.3 years ago
Alexander ▴ 220

The project "Tabula Sapiens" ( https://tabula-sapiens-portal.ds.czbiohub.org/ ) provides scRNA-seq data for huge number of single cells transcripts ( nearly 500,000 cells from 24 organs of 15 normal human subjects ). One can download count matrices of scRNA-seq data here: https://figshare.com/articles/dataset/Tabula_Sapiens_release_1_0/14267219

I took a brief look on data (https://www.kaggle.com/alexandervc/look-on-tablasapiens-bone-marrow) and a bit puzzled by the following:

Question: What is the format for these count matrices - counts, log( 1+ counts ) or probably something else ?

Example: https://www.kaggle.com/alexandervc/look-on-tablasapiens-bone-marrow?scriptVersionId=69932358&cellId=10

The values are e.g. 1.6892005, 1.7121489

if I take summation by genes or summation of 2**( these expressions) or summation exp(these expressions) : neither of these correspond to 'n_counts_UMIs' - provided by separate column. So that seems to me that it is not neither counts, neither log(counts).

scRNA-seq scRNAseq TabulaSapiens • 1.5k views
ADD COMMENT
1
Entering edit mode
3.3 years ago
fracarb8 ★ 1.7k

If you look at the top of the page you linked, you can see that the data you downloaded is from figShare. If you look in the portal you will find that they provide raw data, process data and cellxgene collection. figShare is the processed data. You can also find that the data was processed using scanpy, so the values are likely normalised/scaled counts from scanpy. You should read their publication, or have a better look at their portal, and find exactly how the data was processed.

ADD COMMENT
0
Entering edit mode

Thank you ! As far as I understand publication is NOT yet available. I am missing mentioning scanpy - where do you see it ? Any way scanpy has several versions of normalizations .

ADD REPLY
0
Entering edit mode

You find the reference to scnpy at the bottom of the page, below [Tabula Sapiens on figshare]1, That's exactly why you need to check how the processed the data. If you cannot find on the portal, you should contact them via email, and ask.

ADD REPLY
0
Entering edit mode

It is written "to use with scanpy " not "normalized with scanpy". Any way thank you.

ADD REPLY

Login before adding your answer.

Traffic: 1810 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6