Hello,
I am trying to follow this workflow for scRNA seq analysis. https://broadinstitute.github.io/2020_scWorkshop/data-wrangling-scrnaseq.html#filtering-low-quality-cells
I have loaded the data and computed the percentage of mitochondrial genes for each cell
However, the next step is to load a list of housekeeping genes. A file I do not have. Does anyone have any idea on where to locate this file.
Thanks!
Kind regards,
You don't need this. It's not standard to do this "housekeepers per cell", and generally the term "housekeeper" is poorly defined. Just do the normal "% mitochondrial reads", "total counts per cell" and "detected genes per cell", and then filter cells that are a certain cutoff away, like 3xMAD. If you have multiple celltypes, batches, or conditions, then do QC per batch/celltype/condition. As mentioned already, go easy on cutoffs, rather be lenient than too strict, as you can always go back and make it stricter if you realize downstream that you might still have poor-quality cells in there that cause problems.
a simply google search returns the file you are looking for.