I don't understand the HDF5 file from 10x genomics scRNAseq
1
2
Entering edit mode
2.5 years ago

Hi!

I have recently started my Phd and I'm trying to open a hdf5 file in R. The file is generated from 10x genomics cell ranger pipeline.

Can someone please explain what i, P, Dim, Dimnames,x and factors means or link to someplace i can read more about it in detail. I have tried google but haven't found anything good:/

Dimnames[1] and Dimnames[[2]] are obviously the identified genes and the barcodes. But i don't understand what the others are.

enter image description here

please help me!:)

best / Jonas

10x hdf5 • 2.0k views
ADD COMMENT
0
Entering edit mode

you should read the Seurat manual, may find the answer of the Items you mentioned.

ADD REPLY
0
Entering edit mode

thx for the tips! you are right, I need to start from the basics:)

ADD REPLY
1
Entering edit mode
2.5 years ago
ATpoint 85k

It's a compressed matrix format that you are looking at, rows are genes, samples are columns, but there is not much to "see" there for the human eye other than a lotm of integers and mainly zeros. In any case, by eye you are not going to run any analysis. As suggested already you should follow a guided tutorial such as Seurat or my favourit the extensive Bioconductor single-cell book https://bioconductor.org/books/release/OSCA/ to get started. Basically, you need to transform this into an analysis object, in Bioconductor that would be the SingleCellExperiment format, and then get analysis going, as e.g. described in that book. Does that make sense to you?

ADD COMMENT
0
Entering edit mode

thx, I will check this out!

ADD REPLY

Login before adding your answer.

Traffic: 1555 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6