Question

Intermixed clusters and cell types in UMAP for scRNA-seq analysis

0

Entering edit mode

4 months ago

bio_info ▴ 20

Hello all! I would like to preface this post by saying that I am not a bioinformatician and I need help with the clustering of a scRNA-seq PBMC dataset and generating a UMAP with different annotated cell types. I am running the the R script below and have annotated the cell-types with SingleR using the Monacco Immune Dataset to identify broad cell types (B cells, T cells, NK cells etc.). However in my UMAP all the cell types are intermixed and I would like them to form separate clusters.

# 1. Normalization -----
pbmc <- NormalizeData(pbmc)

# 2. Find variable features -----
pbmc <- FindVariableFeatures(pbmc, nfeatures = 5000)

# 3. Scale data -----
pbmc <- ScaleData(pbmc, vars.to.regress = "percent.mt")

# 4. Compute PCA -----
pbmc <- RunPCA(pbmc)

# 5. Determine dimentionality of the dataset (how many PCs to capture?) -----
ElbowPlot(pbmc, ndims = 50)

# 6. Compute neighbors and clusters -----
pbmc <- FindNeighbors(pbmc, dims = 1:20)

pbmc <- FindClusters(pbmc, resolution = 0.095, group.singletons = F, algorithm = 1)

# 7. Build Cluster tree -----
pbmc <- BuildClusterTree(pbmc, reorder = T, reorder.numeric = T)

# 8. Run UMAP -----
pbmc <- RunUMAP(pbmc, dims = 1:20, n.neighbors = 20)

# 9. Plot UMAP colored by SingleR labels -----
DimPlot(pbmc,
        reduction = "umap",
        raster = F,
        label = T,
        repel = T) | DimPlot(pbmc,
                             reduction = "umap",
                             raster = F,
                             repel = T,
                             group.by = "cell_type") + ggtitle("Cell type annotation")

Can someone please advice me what changes to make to the code to separate the clusters as it looks like there is B cell expression everywhere. I have tried playing around with the resolution and algorithm used in FIndClusters but so far nothing works for me. enter image description here

PBMC UMAP scRAN-seq • 287 views

ADD COMMENT • link updated 4 months ago by jaro.slamecka ▴ 270 • written 4 months ago by bio_info ▴ 20

0

Entering edit mode

Can you share the part where you ran SingleR on your data? You might also need to share some QC plots and details on the library prep/sequencing technology

ADD REPLY • link 4 months ago by jaro.slamecka ▴ 270