Hi,
I am analyzing single cell mouse data of a tissue with a lot of immune infiltration. When focusing on immune cells, clustering with Seurat gave me 5 subtypes using a coarse resolution level, or 8 subclusters using a finer resolution.
I have already manually annotated them into general immune cell types:
> table(droplevels(md[,c("REclustering_res0.5_ct_immune","cell_type_manual_res2_cluster")]))
cell_type_manual_res2_cluster
Immune - 3 Immune - 10 Immune - 11 Immune - 15 Immune - 16
Macrophages-0 0 355 0 0 0
Macrophages-3 0 266 0 0 0
Macrophages-6 0 0 0 0 64
Macrophages-7 57 1 0 0 0
B cells-2 0 0 0 0 305
T cells-1 0 0 0 319 0
T cells-4 0 0 0 196 0
T cells-5 0 0 163 0 0
This is a very rough analysis and I need to do a better classification and annotation of these.
I've found several automated immune cell annotation tools, but they seem to work only on human data, and not mouse. Tools like sc-ImmuCC
and ImmunIC
. Other similar tools like ImmuCellAI-mouse
showed potential, but they only work on immune cell abundance for bulk, not for single cell.
What are my options to annotate these MOUSE immune cells?
When annotating, I like to combine canonical markers with automated annotation systems - the go to for mouse immune populations I use is
1) SingleR with the ImmGen database in the CellDex bioconductor package
2) Azimuth with the Pan Sci Mouse data set
These two methods are orthogonal (SingleR is using correlations and Azimuth is mapping the single-cells to a reference atlas). If you are looking at bone marrow immune populations, I have found Haemopedia RNA-seq to be particularly useful with singleR. For both the Pan Sci Mouse and Haemopedia data, you will have to convert the Ensembl gene to actual gene symbols, here is my code to do it:
Thanks!
I've already used the SingleR method with some success, but I'm stuck trying Azimuth.
I have downloaded the Pansci mouse rds object, but R is crashing while doing the ScaleData / RunPCA/UMAP / SCTransform steps before it can be turned into an Azimuth reference. I have subseted the original object into keeping only immune cells, but even after using Seurat's SketchData to use only 25k cells, R is crashing in my 64Gb RAM computer.
How did you generate the Azimuth object?
I struggled with memory issue to on my laptop - this is the code that worked for me that I ran on my workstatsion
Look at the gene markers driving each cluster to see if you can identify specific cell population. You can also fetch from online databases or create yourself, lists of genes specific to each immune population you are looking for and generate a module score of each list for each cell using
AddModuleScore
from Seurat. Another option, would be to find published dataset with a wide range of annotated immune clusters in mouse and label transfer their annotation to your dataset to annotate your cluster.