Parallel inference for massive distributed spatial data using low-rank models

abstract

2016, Springer Science+Business Media New York. Due to rapid data growth, statistical analysis of massive datasets often has to be carried out in a distributed fashion, either because several datasets stored in separate physical locations are all relevant to a given problem, or simply to achieve faster (parallel) computation through a divide-and-conquer scheme. In both cases, the challenge is to obtain valid inference that does not require processing all data at a single central computing node. We show that for a very widely used class of spatial low-rank models, which can be written as a linear combination of spatial basis functions plus a fine-scale-variation component, parallel spatial inference and prediction for massive distributed data can be carried out exactly, meaning that the results are the same as for a traditional, non-distributed analysis. The communication cost of our distributed algorithms does not depend on the number of data points. After extending our results to the spatio-temporal case, we illustrate our methodology by carrying out distributed spatio-temporal particle filtering inference on total precipitable water measured by three different satellite sensor systems.

authors

Katzfuss, Matthias

published proceedings

Statistics and Computing

altmetric score

0.25

author list (cited authors)

Katzfuss, M., & Hammerling, D.

citation count

26

complete list of authors

Katzfuss, Matthias||Hammerling, Dorit

publication date

March 2017

publisher

Springer Nature Publisher

keywords

49 Mathematical Sciences
4901 Applied Mathematics
4903 Numerical And Computational Mathematics
4905 Statistics

Digital Object Identifier (DOI)

10.1007/s11222-016-9627-4

start page

363

end page

375

volume

27

issue

2

URL

http%3A%2F%2Fdx.doi.org%2F10.1007%2Fs11222-016-9627-4

Parallel inference for massive distributed spatial data using low-rank models Academic Article

Overview

abstract

authors

published proceedings

altmetric score

author list (cited authors)

citation count

complete list of authors

publication date

publisher

Research

keywords

Identity

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

issue

Other

URL