By Topic

Distributed Peer-to-Peer Cooperative Partitional-Divisive Clustering for gene expression datasets

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
R. Kashef ; department of Electrical and Computer Engineering, Pattern Analysis and Machine Intelligence Lab, University of Waterloo, Ontario, N2L3G1 Canada ; M. S. Kamel

Clustering techniques are helpful in understanding gene regulation, cellular processes, and subtypes of cells. A major thrust of gene expression analysis over the last twenty years has been the acquisition of enormous amount of various distributed sources of gene expression datasets. Thus, it is becoming increasingly important to perform clustering of distributed data in-place, without the need to pool it first into a central node. The general goal of distributed clustering is achieving a level of speedup than the centralized approaches. A recent study shows that centralized cooperative clustering outperforms the non-cooperative centralized clustering approaches. In this paper a novel distributed cooperative partitional-divisive clustering in a peer-to-peer network is presented. The distributed CPDC approach is based on intermediate cooperation between the Partitional k-means and the divisive bisecting k-means in a distributed Peer-to-Peer network to produce better global solutions. Computational experiments were conducted to test the performance of the distributed CPDC approach using different gene expression datasets. Undertaken experimental results show that the performance of the distributed CPDC method is better than that of the non-cooperative distributed k-means and distributed bisecting k-means. Thus a new cooperative technique for distributed gene expression repositories is efficiently presented to discover regularities and genes that may span multiple nodes.

Published in:

Computational Intelligence in Bioinformatics and Computational Biology, 2008. CIBCB '08. IEEE Symposium on

Date of Conference:

15-17 Sept. 2008