Selecting the Number of Principal Components in Functional Data.

abstract

Functional principal component analysis (FPCA) has become the most widely used dimension reduction tool for functional data analysis. We consider functional data measured at random, subject-specific time points, contaminated with measurement error, allowing for both sparse and dense functional data, and propose novel information criteria to select the number of principal component in such data. We propose a Bayesian information criterion based on marginal modeling that can consistently select the number of principal components for both sparse and dense functional data. For dense functional data, we also developed an Akaike information criterion (AIC) based on the expected Kullback-Leibler information under a Gaussian assumption. In connecting with factor analysis in multivariate time series data, we also consider the information criteria by Bai & Ng (2002) and show that they are still consistent for dense functional data, if a prescribed undersmoothing scheme is undertaken in the FPCA algorithm. We perform intensive simulation studies and show that the proposed information criteria vastly outperform existing methods for this type of data. Surprisingly, our empirical evidence shows that our information criteria proposed for dense functional data also perform well for sparse functional data. An empirical example using colon carcinogenesis data is also provided to illustrate the results.

authors

Carroll, Raymond

published proceedings

J Am Stat Assoc

author list (cited authors)

Li, Y., Wang, N., & Carroll, R. J.

citation count

58

complete list of authors

Li, Yehua||Wang, Naisyin||Carroll, Raymond J

publication date

December 2013

publisher

Taylor & Francis Publisher

published in

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION Journal

keywords

Akaike Information Criterion
Bayesian Information Criterion
Functional Data Analysis
Kernel Smoothing
Principal Components

PubMed Central ID

24376287

Digital Object Identifier (DOI)

10.1080/01621459.2013.788980

start page

1284

end page

1294

volume

108

issue

504

URL

http://dx.doi.org/10.1080/01621459.2013.788980

Selecting the Number of Principal Components in Functional Data. Academic Article

Overview

abstract

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

PubMed Central ID

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

issue

Other

URL