What Tool/Package Would You Recommend For Clustering Large Gene Expression Data?
3
1
Entering edit mode
11.0 years ago
Diwan ▴ 650

Hi All,

I have a large gene expression dataset of 10K genes with 800 conditions. This is after preprocessing/filtering. I am able to use many R hierarchical clustering tools using correlation/euclidean distance etc. I noticed that packages like pvclust does not work for such large dataset. There are also many other packages out there. Will it be worth to check each package and see if any cluster is identified using most of the methods ?

What package would you use/recommend to cluster large gene expression data?

Thanks Diwan

expression clustering • 3.6k views
ADD COMMENT
2
Entering edit mode
11.0 years ago
Michael 55k

The amap package contains several clustering algorithms that should handle datasets of this size easily.

ADD COMMENT
0
Entering edit mode

Thanks Michael.

ADD REPLY
1
Entering edit mode
11.0 years ago
vj ▴ 520

If you are comfortable with R then you can try WGCNA

ADD COMMENT
0
Entering edit mode
11.0 years ago
5heikki 11k

What package would you use/recommend to cluster large gene expression data?

The same one(s) they use in high impact publications.

ADD COMMENT
8
Entering edit mode

If you can't provide a helpful answer, why bother posting one at all?

ADD REPLY

Login before adding your answer.

Traffic: 2437 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6