Hi All,
Can any one suggest which is the best clustering algorithm to check the coexpression of genes. I used the K-mean clustering algorithm but I suspect it does not cluster correctly.
I have microarray data for total 10 samples from different conditions/tissues. When I cluster data from all the 10 samples then it gives different results than if I cluster data for just one sample. Means, in both cases same genes are clustered in different clusters. Some variation I can expect but results are entirely different in both the cases.
Please help I am new to this.
Thanks, Ritu
@Ritu: how do you cluster one sample?
You need to add a little more info to your question before we can reasonably answer. Things like: What program/package are you using to do the clustering? Which distance metric? What value of K are you using and how did you choose it? Is is 10 samples per condition/tissue or 10 samples total? How many replications per condition? Why do you believe the clustering is 'incorrect'?
Also, getting different results when running 10 samples and when running 1 sample is not very informative. Microarrays have lots of noise and when clustering based on one array you may just be clustering noise.
"When I cluster data from all the 10 samples then it gives different results than if I cluster data for just one sample." Totally overlooked this sentence. But that statement seems odd. I think you need to explain better what your question is.