By Topic

Clustering of High-Dimensional Gene Expression Data with Feature Filtering Methods and Diffusion Maps

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

4 Author(s)
Rui Xu ; Appl. Comput. Intell. Lab. Dept. of Electr. & Comput. Eng., Missouri Univ. of Sci. & Technol. Rolla, Rolla, MO ; Damelin, S. ; Nadler, B. ; Wunsch, D.C.

The importance of gene expression data in cancer diagnosis and treatment by now has been widely recognized by cancer researchers in recent years. However, one of the major challenges in the computational analysis of such data is the curse of dimensionality, due to the overwhelming number of measures of gene expression levels versus the small number of samples. Here, we use a two-step method to reduce the dimension of gene expression data. At first, we extract a subset of genes based on the statistical characteristics of their corresponding gene expression measurements. For further dimensionality reduction, we then apply diffusion maps, which interpret the eigenfunctions of Markov matrices as a system of coordinates on the original data set in order to obtain efficient representation of data geometric descriptions, to the reduced data. A neural network clustering theory, Fuzzy ART, is applied to the resulting data to generate clusters of cancer samples. Experimental results on the small round blue-cell tumor (SRBCT) data set, compared with other widely-used clustering algorithms, demonstrate the effectiveness of our proposed method in addressing multidimensional gene expression data.

Published in:

BioMedical Engineering and Informatics, 2008. BMEI 2008. International Conference on  (Volume:1 )

Date of Conference:

27-30 May 2008