Hi!
I am working on the analysis of microbial data (relative abundances) coming from a large cohort. I have a matrix of Bray-Curtis distances of our samples, and I've ran PCOA on the distance matrix. Our samples seem to cluster into two main cluster, interestingly this clustering doesn't seem to be led by any metadata feature we are aware of. Our plot looks like this
I would like to define the two clusters using k-means (or some other clustering method, k-means just seems to be popular for this kind of analysis), draw the cluster borders on the plot and get a list for the samples belonging to each cluster. Currently I feel a bit stuck at this step. Is there an R or Python package for such analysis that would make my work easier?
Thanks in advance!