Effect of Separate Sampling on Classification and the Minimax Criterion Conference Paper uri icon

abstract

  • It is commonplace in bioinformatics (and elsewhere) to build a classifier from sample data in which the sample sizes of the classes are not random; that is, they are selected prior to sampling. The result is that there is no estimate of the prior class probabilities available from the data. In this paper, we find an analytic result for the minimax solution for the class prior probabilities for a general Neyman-Pearson induced classifier. From that we derive Anderson's classical minimax prior probability 'estimate.' Using synthetic and real data, we demonstrate the degradation in classifier performance from using inaccurate values for the prior probabilities. 2013 IEEE.

name of conference

  • 2013 IEEE International Workshop on Genomic Signal Processing and Statistics

published proceedings

  • 2013 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS (GENSIPS 2013)

author list (cited authors)

  • Esfahani, M. S., & Dougherty, E. R.

citation count

  • 1

complete list of authors

  • Esfahani, Mohammad Shahrokh||Dougherty, Edward R

publication date

  • November 2013