UNEXPECTED PROPERTIES OF BANDWIDTH CHOICE WHEN SMOOTHING DISCRETE DATA FOR CONSTRUCTING A FUNCTIONAL DATA CLASSIFIER.

abstract

The data functions that are studied in the course of functional data analysis are assembled from discrete data, and the level of smoothing that is used is generally that which is appropriate for accurate approximation of the conceptually smooth functions that were not actually observed. Existing literature shows that this approach is effective, and even optimal, when using functional data methods for prediction or hypothesis testing. However, in the present paper we show that this approach is not effective in classification problems. There a useful rule of thumb is that undersmoothing is often desirable, but there are several surprising qualifications to that approach. First, the effect of smoothing the training data can be more significant than that of smoothing the new data set to be classified; second, undersmoothing is not always the right approach, and in fact in some cases using a relatively large bandwidth can be more effective; and third, these perverse results are the consequence of very unusual properties of error rates, expressed as functions of smoothing parameters. For example, the orders of magnitude of optimal smoothing parameter choices depend on the signs and sizes of terms in an expansion of error rate, and those signs and sizes can vary dramatically from one setting to another, even for the same classifier.

authors

Carroll, Raymond

published proceedings

Ann Stat

altmetric score

1

author list (cited authors)

Carroll, R. J., Delaigle, A., & Hall, P.

citation count

9

complete list of authors

Carroll, Raymond J||Delaigle, Aurore||Hall, Peter

publication date

December 2013

publisher

Institute of Mathematical Statistics Publisher

keywords

Centroid Method
Discrimination
Kernel Smoothing
Quadratic Discrimination
Smoothing Parameter Choice
Training Data

Digital Object Identifier (DOI)

10.1214/13-AOS1158

URI

https://hdl.handle.net/1969.1/179367

start page

2739

end page

2767

volume

41

issue

6

URL

http%3A%2F%2Fdx.doi.org%2F10.1214%2F13-aos1158

UNEXPECTED PROPERTIES OF BANDWIDTH CHOICE WHEN SMOOTHING DISCRETE DATA FOR CONSTRUCTING A FUNCTIONAL DATA CLASSIFIER. Academic Article

Overview

abstract

authors

published proceedings

altmetric score

author list (cited authors)

citation count

complete list of authors

publication date

publisher

Research

keywords

Identity

Digital Object Identifier (DOI)

URI

Additional Document Info

start page

end page

volume

issue

Other

URL