Entering edit mode
3.0 years ago
smurph50
▴
50
I would like to use R to generate cluster my 20k cells from a single cell ATAC-seq experiment.
I ran PCA then selected the first 50 components, which were put into tSNE's normalize_input() then Rtsne(). This is the result I get.
I tried multiple perplexities from 5 to 50, number of components from 20 to 200, and UMAP. However the results were roughly the same.
Do you know what could cause this? I did not filter out peaks before running this because I am not sure what cutoff to use.
Show your full code or we're just going to be guessing at what you did. I'd recommend using a framework meant for scATAC analysis like ArchR or Signac.
Thanks Jared! I will use one of those to generate a tsne rather than troubleshooting. Here is my code.
I strongly recommend to follow a guided tutorial and use a dedicated package as mentioned above.
Points that may miss here:
For single-cell RNA-seq I follow https://bioconductor.org/books/release/OSCA/ but I have no hands-on with scATAC-seq so far. I guess just following
ArchR
orSignac
vignettes will save you a lot of trouble.Have you tried normalizing the data before PCA?