Question

RNA-seq cluster analysis in R

0

Entering edit mode

6.6 years ago

MarjoryMollusc ▴ 50

So I have my time series RNA-seq data, which I conducted some k-means clustering on in R to have a look at how it clusters. I am wondering if there are any good tools in R to then analyse the genes within each cluster, or the next step from the k-means clustering? Each cluster has around 50 genes in them.

I've been thinking about investigating nearby transcription factors (mouse cells) however the closest R tool I can find for doing that is pwOmics. I know that the TFcheckpoint webtool is a thing, however I was hoping to avoid copy-pasting forty different sets of genes in there from text files. Are there any similar R tools?

The main thing is that I am a bit unsure where to go from here, as there are a lot of genes within each cluster, and I have about four different plots with about 10 clusters in each. Any suggestions would be super useful! Thanks

RNA-Seq R k-means clustering • 5.3k views

ADD COMMENT • link updated 4.7 years ago by Biostar 20 • written 6.6 years ago by MarjoryMollusc ▴ 50

1

Entering edit mode

Let's say each cluster is one gene set. Taking each of this gene set you could do GO and pathway analysis to understand the biology. Simplest way to do this is DAVID. Further downstream analysis depends on what question you want to address.

ADD REPLY • link 6.6 years ago by Chirag Parsania ★ 2.0k

0

Entering edit mode

Annotation enrichment analysis is a typical way of looking at clusters of genes. There are different R packages for this, e.g. Bioconductor topGO for GO terms enrichment or, in simple situations, you could do it with the fisher.test() function.

As an aside, you may want to have a look at this post about clustering time series.

ADD REPLY • link 6.6 years ago by Jean-Karim Heriche 27k

score 0 · Answer 1 · 2018-06-10

clusterProfiler. Visualization of profile comparison section from Bioconductor package clusterProfiler. A quick solution for your question. see .
co-expressed gene set enrichment analysis, cogena. cogena started with gene expression not clustered genes, While if you want to try serveral clustering methods, including kmeans, and enrichment analysis of each cluster, cogena is recommended. Just put the gene sets gmt file in the installation directory of cogena, R/x86_64-pc-linux-gnu- library/3.2/cogena/extdata, (see vignette). see and