RNA-seq cluster analysis in R
1
0
Entering edit mode
6.6 years ago

So I have my time series RNA-seq data, which I conducted some k-means clustering on in R to have a look at how it clusters. I am wondering if there are any good tools in R to then analyse the genes within each cluster, or the next step from the k-means clustering? Each cluster has around 50 genes in them.

I've been thinking about investigating nearby transcription factors (mouse cells) however the closest R tool I can find for doing that is pwOmics. I know that the TFcheckpoint webtool is a thing, however I was hoping to avoid copy-pasting forty different sets of genes in there from text files. Are there any similar R tools?

The main thing is that I am a bit unsure where to go from here, as there are a lot of genes within each cluster, and I have about four different plots with about 10 clusters in each. Any suggestions would be super useful! Thanks

RNA-Seq R k-means clustering • 5.3k views
ADD COMMENT
1
Entering edit mode

Let's say each cluster is one gene set. Taking each of this gene set you could do GO and pathway analysis to understand the biology. Simplest way to do this is DAVID. Further downstream analysis depends on what question you want to address.

ADD REPLY
0
Entering edit mode

Annotation enrichment analysis is a typical way of looking at clusters of genes. There are different R packages for this, e.g. Bioconductor topGO for GO terms enrichment or, in simple situations, you could do it with the fisher.test() function.

As an aside, you may want to have a look at this post about clustering time series.

ADD REPLY
0
Entering edit mode
6.5 years ago
Zhilong Jia ★ 2.2k
  1. clusterProfiler. Visualization of profile comparison section from Bioconductor package clusterProfiler. A quick solution for your question. see figure.

  2. co-expressed gene set enrichment analysis, cogena. cogena started with gene expression not clustered genes, While if you want to try serveral clustering methods, including kmeans, and enrichment analysis of each cluster, cogena is recommended. Just put the gene sets gmt file in the installation directory of cogena, R/x86_64-pc-linux-gnu- library/3.2/cogena/extdata, (see vignette). see figure and figure

ADD COMMENT

Login before adding your answer.

Traffic: 1931 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6