Hi,
I'm trying to analyze my data set using the annotated data from the tabula muris repository.
I have downloaded the FACS-sorted data, read it into R, as well as my own data set.
After QC and filtering of both sets, I have tried to integrate them, but now I'm not sure how to proceed.
first pre-processing my data set
standard <- NormalizeData(standard, normalization.method ="LogNormalize", scale.factor = 10000)
standard <- FindVariableFeatures(standard, selection.method = "mean.var.plot",
mean.cutoff = c(0.0125, 3),
dispersion.cutoff = c(0.5, Inf))
standard <- ScaleData(standard, vars.to.regress = c("nCount_RNA", "percent.mt"), verbose = TRUE)
This data is not annotated and I would like to use the annotations from the tabula muris data to achieve that.
Next, I prepare the data from tabula muris. After creating the Seurat object.list from several tables I pre-process the list.
for (i in 1:length(object.list)) {
object.list[[i]] <- NormalizeData(object.list [[i]], verbose = FALSE)
object.list[[i]] <- FindVariableFeatures(
object.list[[i]], selection.method = "vst",
nfeatures = 2000, verbose = FALSE)
object.list[[i]] <- ScaleData(object.list [[i]], verbose = FALSE) # Scale the data using the selected features.
object.list[[i]]$batch <- paste0("Batch", i)
}
Followed by the integration
integration.features <- SelectIntegrationFeatures(object.list = object.list, nfeatures = 2000)
anchors <- FindIntegrationAnchors(
object.list = object.list,
anchor.features = integration.features
)
integrated <- IntegrateData(anchorset = anchors, normalization.method = "LogNormalize")
This results in another Seurat Object.
Now I need to combine them, but I'm not sure how to do so.
Do I need to use the Seurat object, Integrated
or better to use the anchors
object to annotate my standard
object?
Can I just do merge?
combined <- merge(integrated, y = standard)
combined <- ScaleData(combined)
combined <- RunPCA(combined)
combined <- RunUMAP(combined, dims = 1:30)
#DimPlot(combined, group.by = "batch", reduction = "umap") + ggtitle("Integration Quality by Batch")
What you are integrating at the moment, from your code, are the different batches of the tabula muris.
Which version of Seurat do you have, 4 or 5 ?
Also, why are you regressing
nCount_RNA
? This will be normalized atNormalizeData
stepThe
object.list
integrate different tissues files.batch*
is just a way for me t differentiate between their origin.Yes I'm using V5.
didn't see my mistake there, thanks for noticing.