Classifier Design Given an Uncertainty Class of Feature Distributions via Regularized Maximum Likelihood and the Incorporation of Biological Pathway Knowledge in Steady-State Phenotype Classification.

abstract

Contemporary high-throughput technologies provide measurements of very large numbers of variables but often with very small sample sizes. This paper proposes an optimization-based paradigm for utilizing prior knowledge to design better performing classifiers when sample sizes are limited. We derive approximate expressions for the first and second moments of the true error rate of the proposed classifier under the assumption of two widely-used models for the uncertainty classes; -contamination and p-point classes. The applicability of the approximate expressions is discussed by defining the problem of finding optimal regularization parameters through minimizing the expected true error. Simulation results using the Zipf model show that the proposed paradigm yields improved classifiers that outperform traditional classifiers that use only training data. Our application of interest involves discrete gene regulatory networks possessing labeled steady-state distributions. Given prior operational knowledge of the process, our goal is to build a classifier that can accurately label future observations obtained in the steady state by utilizing both the available prior knowledge and the training data. We examine the proposed paradigm on networks containing NF-B pathways, where it shows significant improvement in classifier performance over the classical data-only approach to classifier design. Companion website: http://gsp.tamu.edu/Publications/supplementary/shahrokh12a.

authors

published proceedings

Pattern Recognit

author list (cited authors)

Esfahani, M. S., Knight, J., Zollanvari, A., Yoon, B., & Dougherty, E. R.

citation count

9

complete list of authors

Esfahani, Mohammad Shahrokh||Knight, Jason||Zollanvari, Amin||Yoon, Byung-Jun||Dougherty, Edward R

publication date

January 2013

publisher

Elsevier Publisher

published in

Pattern Recognition Journal

keywords

Biological-pathway Knowledge
Regularized
Steady-state Classifier
Uncertainty Class

Digital Object Identifier (DOI)

10.1016/j.patcog.2013.02.017

start page

2783

end page

2797

volume

46

issue

10

URL

http%3A%2F%2Fdx.doi.org%2F10.1016%2Fj.patcog.2013.02.017

Classifier Design Given an Uncertainty Class of Feature Distributions via Regularized Maximum Likelihood and the Incorporation of Biological Pathway Knowledge in Steady-State Phenotype Classification. Academic Article

Overview

abstract

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

issue

Other

URL