I'm trying to look through a published dataset to check for gene expression for my own project. I am having trouble finding materials to teach me how to analyze single cell RNA sequencing data on Seurat R from the GEO NCBI database. Since multiple barcodes.tsv.gz,genes.tsv.gz and matrix.mtx.gz files are there in this datasets. How to analyze those? Your help is much appreciated. Thanks in advance !
Dataset
GSE128423, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE128423
datasets looks like this
GSM3674224_std1.barcodes.tsv.gz GSM3674231_ctrl_16May.matrix.mtx.gz GSM3674239_b1.genes.tsv.gz
GSM3674224_std1.genes.tsv.gz GSM3674232_ctrl_26May.barcodes.tsv.gz GSM3674239_b1.matrix.mtx.gz
GSM3674224_std1.matrix.mtx.gz GSM3674232_ctrl_26May.genes.tsv.gz GSM3674240_b2.barcodes.tsv.gz
GSM3674225_std2.barcodes.tsv.gz GSM3674232_ctrl_26May.matrix.mtx.gz GSM3674240_b2.genes.tsv.gz
GSM3674225_std2.genes.tsv.gz GSM3674233_ctrl_7Jun.barcodes.tsv.gz GSM3674240_b2.matrix.mtx.gz
GSM3674225_std2.matrix.mtx.gz GSM3674233_ctrl_7Jun.genes.tsv.gz GSM3674241_b3.barcodes.tsv.gz
GSM3674226_std3.barcodes.tsv.gz GSM3674233_ctrl_7Jun.matrix.mtx.gz GSM3674241_b3.genes.tsv.gz
GSM3674226_std3.genes.tsv.gz GSM3674234_ctrl_8May.barcodes.tsv.gz GSM3674241_b3.matrix.mtx.gz
GSM3674226_std3.matrix.mtx.gz GSM3674234_ctrl_8May.genes.tsv.gz GSM3674242_b4.barcodes.tsv.gz
This is standard output of
cellranger count
analysis. You can use these files withseurat
following one of the tutorials below.https://www.fimm.fi/sites/default/files/Seurat-guideline-10x.pdf
https://hbctraining.github.io/scRNA-seq/lessons/03_SC_quality_control-setup.html
@GenoMax,
Thank you very much for your reply. I know how to use Seurat from Cellranger output. But I'm not sure how to deal with multiple files (several barcodes.tsv.gz, several matrix files, etc.)