By Topic

A Novel Multiclass Gene Selection Method based on SVM/MLP Cross Validation

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

4 Author(s)
Junying Zhang ; School of Computer Science and Engineering, Xidian University, Xi'an, 710071, P.RChina. Email: ; Hongyi Zhang ; Shenling Liu ; Yue Joseph Wang

Gene selection is one of the major challenges of biochip technology for resolution of curse of dimensionality which occurs especially in DNA microarray dataset where there are more than thousands of genes and only a few experiments (samples), and for gene diagnosis where only a gene subset is enough for diagnosis of diseases. This paper presents a gene selection method by training linear SVM (support vector machine)/nonlinear MLP (multi-layer perceptron) classifiers and testing them with cross validation for finding gene subset which is optimal/suboptimal for diagnosis of binary/multiple disease classes. The process is to select genes with linear SVM classifier incrementally for the diagnosis of each binary disease class pair, by testing its generalization ability with leave-one-out cross validation; the union of them is used as initialized gene subset for the discrimination of all the disease classes, from which genes are deleted one by one decrementally by removing the gene which brings the greatest decrease of the generalization power after the removal, where generalization is measured by leave-one-out and leave-4-out cross validation. For real DNA microarray data with 2308 genes and only 64 labelled samples belonging to 4 disease classes, only 6 genes are selected to be diagnostic genes. The diagnostic genes are tested with 6-2-4 MLP with both leave-one-out and leave-4-out cross validation, resulting in no misclassification

Published in:

2006 International Conference on Mechatronics and Automation

Date of Conference:

25-28 June 2006