Scheduled System Maintenance:
On May 6th, single article purchases and IEEE account management will be unavailable from 8:00 AM - 12:00 PM ET (12:00 - 16:00 UTC). We apologize for the inconvenience.
By Topic

An integrated system for class prediction using gene expression profiling

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

4 Author(s)
Dechang Chen ; Preventive Medicine & Biometrics, Uniformed Services Univ. of the Health Sci., Bethesda, MD, USA ; Hua, D.D. ; Zhenqiu Liu ; Zhi-Fu Cheng

Gene expression profiles have been successfully applied to class prediction. Due to a large number of genes (features) and a small number of samples in gene expression data, feature selection is essential when performing the prediction task. Many methods have been proposed to select features in microarray data analysis, but there is no unique method which performs uniformly well for all the learning algorithms. It is then practical to find a feature selection method and a learning algorithm that give superior performance. In this paper, we present an integrated scheme to perform the task of class prediction based on gene expression profiles. The scheme incorporates a simple novel feature selection procedure into naive Bayes models. Each selected gene has a high score of discriminatory power determined by the Brown-Forsythe test statistics. Any pair of selected genes have a low correlation. This facilitates the use of the conditional independence among genes assumed by the naive Bayes models. To demonstrate the effectiveness, the proposed scheme was applied to three commonly used expression data sets COLON, OVARIAN, and LEUKEMIA. The results show that the numbers of misclassified samples are 0, and 4, respectively.

Published in:

Control, Automation, Robotics and Vision Conference, 2004. ICARCV 2004 8th  (Volume:2 )

Date of Conference:

6-9 Dec. 2004