BRISC: bootstrap for rapid inference on spatial covariances

被引:18
作者
Saha, Arkajyoti [1 ]
Datta, Abhirup [1 ]
机构
[1] Johns Hopkins Bloomberg Sch Publ Hlth, Dept Biostat, 615 North Wolfe St, Baltimore, MD 21205 USA
关键词
algorithms; bootstrap; computationally intensive methods; geostatistics; spatial statistics; statistical computing; GAUSSIAN PROCESS MODELS; APPROXIMATION;
D O I
10.1002/sta4.184
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In geostatistics, inference on spatial covariance parameters of the Gaussian process is often critical to scientists for understanding structural dependence in data. Finite-sample inference customarily proceeds either using posterior distributions from fully a Bayesian approach or via resampling/subsampling techniques in a frequentist setting. Resampling methods, in particular, the bootstrap, have become more attractive in the modern age of big data as, unlike Bayesian models that require sequential sampling from Markov chain Monte Carlo, they naturally lend themselves to parallel computing resources. However, a spatial bootstrap involves an expensive Cholesky decomposition to decorrelate the data. In this manuscript, we develop a highly scalable parametric spatial bootstrap that uses sparse Cholesky factors for parameter estimation and decorrelation. The proposed bootstrap for rapid inference on spatial covariances (BRISC) algorithm requires linear memory and computations and is embarrassingly parallel, thereby delivering substantial scalability. Simulation studies highlight the accuracy and computational efficiency of our approach. Analysing large satellite temperature data, BRISC produces inference that closely matches that delivered from a state-of-the-art Bayesian approach, while being several times faster. The R package BRISC is now available for download from GitHub () and will be available on CRAN soon. Copyright (c) 2018 John Wiley & Sons, Ltd.
引用
收藏
页数:16
相关论文
共 29 条
[1]  
[Anonymous], GEOSTATISTICS MODELI
[2]  
[Anonymous], 2017, ARXIV171005013
[3]  
[Anonymous], 1997, CAMBRIDGE SERIES STA, DOI DOI 10.1017/CB09780511802843
[4]  
Banerjee, 2017, APPLYING NEAREST NEI
[5]   Stationary process approximation for the analysis of large spatial datasets [J].
Banerjee, Sudipto ;
Gelfand, Alan E. ;
Finley, Andrew O. ;
Sang, Huiyan .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 :825-848
[6]  
Chen HF, 2002, STOCHASTIC APPROXIMA, DOI [10. 1016/B978-0-12-395651-4. 50030-1, DOI 10.1016/B978-0-12-395651-4.50030-1]
[7]   NONSEPARABLE DYNAMIC NEAREST NEIGHBOR GAUSSIAN PROCESS MODELS FOR LARGE SPATIO-TEMPORAL DATA WITH AN APPLICATION TO PARTICULATE MATTER ANALYSIS [J].
Datta, Abhirup ;
Banerjee, Sudipto ;
Finley, Andrew O. ;
Hamm, Nicholas A. S. ;
Schaap, Martijn .
ANNALS OF APPLIED STATISTICS, 2016, 10 (03) :1286-1316
[8]   Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geostatistical Datasets [J].
Datta, Abhirup ;
Banerjee, Sudipto ;
Finley, Andrew O. ;
Gelfand, Alan E. .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2016, 111 (514) :800-812
[9]  
DAVIS MW, 1987, MATH GEOL, V19, P91
[10]   1977 RIETZ LECTURE - BOOTSTRAP METHODS - ANOTHER LOOK AT THE JACKKNIFE [J].
EFRON, B .
ANNALS OF STATISTICS, 1979, 7 (01) :1-26