I'm hoping for some help on drug clustering by gene list. The data I have has not come through any particular workflow, it's all held in text files.
I have a 100 drugs and per drug I have a list up/down genes. Whether a gene is up/down regulated is purely by its presence in a text file (as two columns, "up" and "down"). There is no numerical expression data at all.
I have very little clustering experience. I was hoping to learn whether it would be possible cluster among the 100 drugs by their up/down gene lists similarity. As for a clustering cut-off, I'm not entirely sure and I am happy for that to be exploratory for the time being providing it finds some separation.
Anything available in R would be very helpful.
Thanks
It may be worth exploring the literature for some practical examples, if you have not done so already. Some random examples are below:
On the drug front, have a look at the Open Targets Platform and its batch search. It may useful to try it out for your list of genes and find which diseases are associated with your genes, any pathway (and GO) enrichment set in that list, and an overview of possible protein interactions among those.
Thanks for all of the information. I work down to the wire, but when I've looked into this wealth of information I'll reply.
Another idea may be to just perform a simple gene enrichment and then plot them, as I do here: Clustering of DAVID gene enrichment results from gene expression studies
Not 100% what you want, though.