By Topic

A criterion for choosing between full-sample and hold-out classifier design

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Marcel Brun ; Universidad Nacional de Mar del Plata, Bs.As., Argentina ; Qian Xu ; Edward R. Dougherty

Is it better to design a classifier and estimate its error on the full sample or to design a classifier on a training subset and estimate its error on the hold-out test subset? Full-sample design provides the better classifier; nevertheless, one might choose hold-out with the hope of better error estimation. A conservative criterion to decide the best course is to aim at a classifier whose error is less than a given bound. Then the choice between full-sample and hold-out design depends on which possesses the smaller expected bound. Using this criterion, we examine the choice between hold-out and several full-sample error estimators using covariance models. The relation between the two designs is revealed via a decomposition of the expected bound into the sum of the expected true error and the expected conditional standard deviation of the true error.

Published in:

2008 IEEE International Workshop on Genomic Signal Processing and Statistics

Date of Conference:

8-10 June 2008