The public breast cancer data I am using contains annotation in metadata(like epithelial, B cell , T cell etc)..but I want to annotate different subtypes of epithelial cells like luminal epithelial, basal epithelial cells....different subtypes of T cells & B cells....and so on.
While annotating breast cancer scRNA data using singleR (Human primary cell atlas as reference) I got clusters labelled as hepatocytes, chondrocytes etc cell type. But we are not supposed to find these cell types in breast cancer tissue.
1. What did I do wrong here?
library(Seurat)
library(celldex)
library(SingleR)
hpca.se <- celldex::HumanPrimaryCellAtlasData()
merged_seurat_objects_filtered_harmony_integrated_SingleR <- GetAssayData(merged_seurat_objects_filtered_harmony_integrated, slot="data")
hesc <- SingleR(test = merged_seurat_objects_filtered_harmony_integrated_SingleR, ref = hpca.se, labels = hpca.se$label.main)
merged_seurat_objects_filtered_harmony_integrated$celltype <- hesc$labels
2. Can I use marker genes from the differentially expressed gene of each cluster to annotate the cell subtypes?
3. What if no marker gene is present in the DEGs? How will I annotate?
4. Aren't the DEGs affected by cancer state?
5. Any tutorial/ paper to clear these doubts regarding sc RNA seq annotation?