snRNA seq analysis - huge computational demand due to large number of entries
1
0
Entering edit mode
16 months ago
applepie ▴ 10

Hello Everyone:

I have been conducting single nuclear RNA sequencing (snRNA-seq) analysis of data downloaded from gene expression omnibus (GEO). (The accession number of the dataset is GSE200366, which is a mouse dataset)

I have used the Seurat Packages to load one of the feature-barcode matrix. However, when I tried to view the metadata, it turned out that there were 6,794,880 entries (as shown in the attached figure).

Regarding this, I have two questions:

  1. Would it be too computationally demanding to analyse the the snRNA-seq with such enormous number of entries?

  2. How can I smoothen and quicken the process of downstream analysis even in the presence of such large number of entries ?

Thanks! enter image description here

seq snRNA • 976 views
ADD COMMENT
0
Entering edit mode

Please follow a standard workflow such as (or Seurat): https://bioconductor.org/books/release/OSCA/ These account for this sort of thing.

ADD REPLY
0
Entering edit mode
16 months ago
predeus ★ 2.1k

You've added "raw" matrix, which has lots of barcodes that are pretty much empty droplets. What you need is a filtered output - using an algorithm like EmptyDrops, you can identify the barcodes that most probably contain cells (in your case, nuclei), and only load that part. Normally, Cell Ranger or other tools would make a filtered matrix for you. It seems like the authors has only shared the raw matrices, however. You can install R package named DropletUtils and run emptyDrops function on your raw matrix.

Hope this helps.

ADD COMMENT
0
Entering edit mode

Ok! I really appreciate your help.

One more question: Can emptyDrops be directly applied to seurat object?

ADD REPLY
0
Entering edit mode

I don't remember - I think you might have to convert it to SingleCellExperiment

Or, alternatively, you can read the Cell Ranger output you're looking at as SCE, run emptyDrops on it, and then save the filtered matrix.

ADD REPLY

Login before adding your answer.

Traffic: 1529 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6