Bayesian site selection for fast Gaussian process regression Academic Article uri icon

abstract

  • Gaussian Process (GP) regression is a popular method in the field of machine learning and computer experiment designs; however, its ability to handle large data sets is hindered by the computational difficulty in inverting a large covariance matrix. Likelihood approximation methods were developed as a fast GP approximation, thereby reducing the computation cost of GP regression by utilizing a much smaller set of unobserved latent variables called pseudo points. This article reports a further improvement to the likelihood approximation methods by simultaneously deciding both the number and locations of the pseudo points. The proposed approach is a Bayesian site selection method where both the number and locations of the pseudo inputs are parameters in the model, and the Bayesian model is solved using a reversible jump Markov chain Monte Carlo technique. Through a number of simulated and real data sets, it is demonstrated that with appropriate priors chosen, the Bayesian site selection method can produce a good balance between computation time and prediction accuracy: it is fast enough to handle large data sets that a full GP is unable to handle, and it improves, quite often remarkably, the prediction accuracy, compared with the existing likelihood approximations. 2014 Taylor and Francis Group, LLC.

published proceedings

  • IIE TRANSACTIONS

author list (cited authors)

  • Pourhabib, A., Liang, F., & Ding, Y. u.

citation count

  • 7

complete list of authors

  • Pourhabib, Arash||Liang, Faming||Ding, Yu

publication date

  • May 2014