Rapid advances of microarray technologies are making it possible to analyze and manipulate large amounts of gene expression data. Clustering algorithms, such as hierarchical clustering, self-organizing maps, k-means and fuzzy k-means, have become important tools for expression analysis of microarray data. However, the need of prior knowledge of the number of clusters, k, and the fuzziness parameter, b, limits the usage of fuzzy clustering. Few approaches have been proposed for assigning the best possible values for such parameters. In this paper, we use simulated annealing and fuzzy k-means clustering to determine the optimal parameters, namely the number of clusters, k, and the fuzziness parameter, b. Our results show that a nearly-optimal pair of k and b can be obtained without exploring the entire search space.
Published in:
Chilean Computer Science Society, 2005. SCCC 2005. 25th International Conference of the
Date of Conference: 7-11 Nov. 2005