clustering from breast cancer RNA-seq data
3
0
Entering edit mode
8.6 years ago

Hello!! I work in breast cancer and I have RNA-seq data from 60 patients. I classified the samples into intrinsic subtypes by immunohistochemistry and I have been using DEseq2 to perform differentially expression analysis between subtypes. I would like to do a global analysis. I mean, I would like to see which samples cluster together according to the gene expression, something unsupervised but using the whole transcriptome. I already used PAM50 to assign samples into intrinsic subtypes according to gene expression but I would like to see what samples cluster together and what characterized each cluster and from there I would like to do the differentially expression analysis…what tools can I use?

RNA-Seq • 3.5k views
ADD COMMENT
0
Entering edit mode
8.6 years ago
Michael Love ★ 2.6k

Have you seen the clustering examples in the DESeq2 vignette or workflow?

vignette("DESeq2")

For 60 samples, I would apply the varianceStabilizingTransformation (see vignette), then explore with the plots we show in the vignette. You can use built in R tools to cluster, for example:

mat <- assay(vsd)
hc <- hclust(dist(t(mat)))
plot(hc)
cutree(hc, k=3)

If you get the latest version of R (3.3) and the latest version of DESeq2 (1.12) you can use an even faster version of VST I just implemented:

vsd <- vst(dds)
ADD COMMENT
0
Entering edit mode
8.6 years ago

Thank you!! and after clustering, How can I explore the gene expression of each cluster? I mean, what characterize each cluster...

ADD COMMENT
0
Entering edit mode
5.0 years ago
skjobs ▴ 190

For the RNA seq or higher dimensional matrix, I will recommend you to use the tSNE algorithm for unsupervised clustering i.e. PCA.

ADD COMMENT

Login before adding your answer.

Traffic: 1570 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6