Classification with reject option in gene expression data.

abstract

MOTIVATION: The classification methods typically used in bioinformatics classify all examples, even if the classification is ambiguous, for instance, when the example is close to the separating hyperplane in linear classification. For medical applications, it may be better to classify an example only when there is a sufficiently high degree of accuracy, rather than classify all examples with decent accuracy. Moreover, when all examples are classified, the classification rule has no control over the accuracy of the classifier; the algorithm just aims to produce a classifier with the smallest error rate possible. In our approach, we fix the accuracy of the classifier and thereby choose a desired risk of error. RESULTS: Our method consists of defining a rejection region in the feature space. This region contains the examples for which classification is ambiguous. These are rejected by the classifier. The accuracy of the classifier becomes a user-defined parameter of the classification rule. The task of the classification rule is to minimize the rejection region with the constraint that the error rate of the classifier be bounded by the chosen target error. This approach is also used in the feature-selection step. The results computed on both synthetic and real data show that classifier accuracy is significantly improved. AVAILABILITY: Companion Website. http://gsp.tamu.edu/Publications/rejectoption/

authors

Dougherty, Edward

published proceedings

Bioinformatics

altmetric score

3

author list (cited authors)

Hanczar, B., & Dougherty, E. R.

citation count

46

complete list of authors

Hanczar, Blaise||Dougherty, Edward R

publication date

September 2008

publisher

Oxford University Press (OUP) Publisher

published in

Bioinformatics Journal

keywords

Algorithms
Artifacts
Artificial Intelligence
Gene Expression Profiling
Oligonucleotide Array Sequence Analysis
Pattern Recognition, Automated
Reproducibility Of Results
Sensitivity And Specificity

PubMed Central ID

18621758

Digital Object Identifier (DOI)

10.1093/bioinformatics/btn349

start page

1889

end page

1895

volume

24

issue

17

URL

http%3A%2F%2Fdx.doi.org%2F10.1093%2Fbioinformatics%2Fbtn349

Classification with reject option in gene expression data. Academic Article

Overview

abstract

authors

published proceedings

altmetric score

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

PubMed Central ID

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

issue

Other

URL