Nonparametric and nonlinear models and data mining in time series: a casestudy on the Canadian lynx data Academic Article uri icon


  • Northern Illinois University, De Kalb, USA Summary. Nonparametric regression methods are used as exploratory tools for formulating, identifying and estimating non-linear models for the Canadian lynx data, which have attained benchmark status in the time series literature since the work of Moran in 1953. To avoid the curse of dimensionality in the nonparametric analysis of this short series with 114 observations, we confine attention to the restricted class of additive and projection pursuit regression (PPR) models and rely on the estimated prediction error variance to compare the predictive performance of various (non-) linear models. A PPR model is found to have the smallest (in-sample) estimated prediction error variance of all the models fitted to these data in the literature. We use a data perturbation procedure to assess and adjust for the effect of data mining on the estimated prediction error variances; this renders most models fitted to the lynx data comparable and nearly equivalent. However, on the basis of the mean-squared error of out-of-sample prediction error, the semiparametric model X t = 1.08 + 1.37 X t-1 + f(X t-2 ) + e t and Tong's self-exciting threshold autoregressive model perform much better than the PPR and other models known for the lynx data.

published proceedings

  • Journal of the Royal Statistical Society Series C (Applied Statistics)

author list (cited authors)

  • Lin, T. C., & Pourahmadi, M.

citation count

  • 21

complete list of authors

  • Lin, TC||Pourahmadi, M

publication date

  • January 1, 2008 11:11 AM