Hello,
I am currently analyzing a publically available dataset with ~200 samples. I want to cluster the samples based on their expression to identify disease endotypes. I already tried out hclust
and NMF
. I know that the Leiden
algorithm is often used in single cell analysis and performs quite well there, so my idea was to also try this out. Ultimately, I would simply pretend that my bulk RNAseq samples are "cells" so that I can use Seurat to perform the clustering steps.
However, I did not find any papers in the literature that used the Leiden algorithm to perform bulk RNA seq clustering. This makes me wonder, if I am overlooking something and that the Leiden algorithm or my approach (pretend samples are cells so I can use Seurat) is not suitable.
I would appreciate any insights or comments on this. Thanks!
Thank you for your comment! The resources you linked are very helpful and I have to say that I did not find them before.
hclust
worked okay for me, but I thought I might try out several algorithm to see if the identified clusters are "robust", i.e. insensitive to the algorithm I am using.