I have gathered Single-cell RNA-seq data from various studies, and to address batch effects among them, I applied the scGen
method. This method was chosen because it resulted in improved clustering and provided a tabular output.
Now, prior to proceeding with the calculation of Expression levels (average expression for each gene across cells), I have the following questions:
- Is it appropriate to normalize the data
NormalizeData()
after batch correction? I believe we need to do it since batch correction is for unwanted variables/ technical variables, not library size. - The
scGen
method yields negative values (likely due to its requirement of log-normalized data for training the model). Should I be concerned about these negative values, or are they inconsequential given that my goal is to obtain a normalized count table?
I really appreciate any help you can provide!
nice find - but, feels bad man. miss sean davis being here.