By Topic

Application of Double Clustering to Gene Expression Data for Class Prediction

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Al-Shalalfa, M. ; Dept. of Comput. Sci., Univ. of Calgary, Calgary, AB ; Alhajj, R.

Extracting significant features from gene expression data is a hot subject that continues to receive great attention. Many methods have been proposed in the literature to deal with this issue, but all of these methods deal with features obtained directly from the data. Since microarray data exhibit a high degree of noise, in this paper we try to reduce the noise by using double clustering approach to identify reduced set of features capable of distinguishing between two classes. Also, we showed that the transformation of the data plays a significant role in classification. We have used two forms of data, and we have used k-means and self organizing map for clustering. Support vector machine and binary decision trees are used for classification. As a result of the conducted experiments on AML/ALL data, we have observed that CSVM is able to correctly classify the whole training and testing data when the data is log2 transformed using only few features.

Published in:

Advanced Information Networking and Applications Workshops, 2007, AINAW '07. 21st International Conference on  (Volume:1 )

Date of Conference:

21-23 May 2007