A criterion for choosing between full-sample and hold-out classifier design

abstract

Is it better to design a classifier and estimate its error on the full sample or to design a classifier on a training subset and estimate its error on the hold-out test subset? Full-sample design provides the better classifier; nevertheless, one might choose hold-out with the hope of better error estimation. A conservative criterion to decide the best course is to aim at a classifier whose error is less than a given bound. Then the choice between full-sample and hold-out design depends on which possesses the smaller expected bound. Using this criterion, we examine the choice between hold-out and several full-sample error estimators using covariance models. The relation between the two designs is revealed via a decomposition of the expected bound into the sum of the expected true error and the expected conditional standard deviation of the true error. 2008 IEEE.

name of conference

2008 IEEE International Workshop on Genomic Signal Processing and Statistics

authors

Dougherty, Edward

published proceedings

2008 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS

author list (cited authors)

Brun, M., Xu, Q., & Dougherty, E. R.

citation count

2

complete list of authors

Brun, Marcel||Xu, Qian||Dougherty, Edward R

publication date

June 2008

publisher

Institute of Electrical and Electronics Engineers (IEEE) Publisher

published in

IEEE International Workshop on Genomic Signal Processing and Statistics : [proceedings]. IEEE International Workshop on Genomic Signal Processing and Statistics Journal

keywords

Emerging Infectious Diseases

Digital Object Identifier (DOI)

10.1109/gensips.2008.4555662

International Standard Book Number (ISBN) 13

978-1-4244-2371-2

start page

30

end page

+

URL

http%3A%2F%2Fdx.doi.org%2F10.1109%2Fgensips.2008.4555662

A criterion for choosing between full-sample and hold-out classifier design Conference Paper

Overview

abstract

name of conference

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

Digital Object Identifier (DOI)

International Standard Book Number (ISBN) 13

Additional Document Info

start page

end page

Other

URL