Performance of feature selection methods. - Texas A&M University (TAMU) Scholar

abstract

High-throughput biological technologies offer the promise of finding feature sets to serve as biomarkers for medical applications; however, the sheer number of potential features (genes, proteins, etc.) means that there needs to be massive feature selection, far greater than that envisioned in the classical literature. This paper considers performance analysis for feature-selection algorithms from two fundamental perspectives: How does the classification accuracy achieved with a selected feature set compare to the accuracy when the best feature set is used and what is the optimal number of features that should be used? The criteria manifest themselves in several issues that need to be considered when examining the efficacy of a feature-selection algorithm: (1) the correlation between the classifier errors for the selected feature set and the theoretically best feature set; (2) the regressions of the aforementioned errors upon one another; (3) the peaking phenomenon, that is, the effect of sample size on feature selection; and (4) the analysis of feature selection in the framework of high-dimensional models corresponding to high-throughput data.

authors

Dougherty, Edward

published proceedings

Curr Genomics

altmetric score

2.5

author list (cited authors)

Dougherty, E. R., Hua, J., & Sima, C.

citation count

25

complete list of authors

Dougherty, Edward R||Hua, Jianping||Sima, Chao

publication date

September 2009

publisher

BENTHAM SCIENCE PUBLISHERS Publisher

published in

Current Genomics Journal

keywords

Generic Health Relevance

PubMed Central ID

20190952

Digital Object Identifier (DOI)

10.2174/138920209789177629

start page

365

end page

374

volume

10

issue

6

URL

http%3A%2F%2Fdx.doi.org%2F10.2174%2F138920209789177629

Performance of feature selection methods. Academic Article

Overview

abstract

authors

published proceedings

altmetric score

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

PubMed Central ID

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

issue

Other

URL