Approximate Bayesian inference for large spatial datasets using predictive process models

被引:43
作者
Eidsvik, Jo [1 ]
Finley, Andrew O. [2 ]
Banerjee, Sudipto [3 ]
Rue, Havard [1 ]
机构
[1] NTNU, Dept Math Sci, Trondheim, Norway
[2] Michigan State Univ, Dept Forestry, E Lansing, MI 48824 USA
[3] Univ Minnesota, Dept Biostat, Minneapolis, MN 55455 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
Approximate Bayesian inference; Computational statistics; Gaussian processes; Geostatistics; Laplace approximation; Predictive process model; LARGE DATA SETS; LIKELIHOOD;
D O I
10.1016/j.csda.2011.10.022
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The challenges of estimating hierarchical spatial models to large datasets are addressed. With the increasing availability of geocoded scientific data, hierarchical models involving spatial processes have become a popular method for carrying out spatial inference. Such models are customarily estimated using Markov chain Monte Carlo algorithms that, while immensely flexible, can become prohibitively expensive. In particular, fitting hierarchical spatial models often involves expensive decompositions of dense matrices whose computational complexity increases in cubic order with the number of spatial locations. Such matrix computations are required in each iteration of the Markov chain Monte Carlo algorithm, rendering them infeasible for large spatial datasets. The computational challenges in analyzing large spatial datasets are considered by merging two recent developments. First, the predictive process model is used as a reduced-rank spatial process, to diminish the dimensionality of the model. Then a computational framework is developed for estimating predictive process models using the integrated nested Laplace approximation. The settings where the first stage likelihood is Gaussian or non-Gaussian are discussed. Issues such as predictions and model comparisons are also discussed. Results are presented for synthetic data and several environmental datasets. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:1362 / 1380
页数:19
相关论文
共 46 条
[1]   Approximate inference for disease mapping [J].
Ainsworth, L. M. ;
Dean, C. B. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 50 (10) :2552-2570
[2]  
[Anonymous], 1998, Case Studies in Environmental Statistics, DOI DOI 10.1007/978-1-4612-2226-2_4
[3]  
[Anonymous], 1999, INTERPOLATION SPATIA
[4]  
Banerjee S., 2003, Hierarchical modeling and analysis for spatial data
[5]   Stationary process approximation for the analysis of large spatial datasets [J].
Banerjee, Sudipto ;
Gelfand, Alan E. ;
Finley, Andrew O. ;
Sang, Huiyan .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 :825-848
[6]   Hierarchical Spatial Process Models for Multiple Traits in Large Genetic Trials [J].
Banerjee, Sudipto ;
Finley, Andrew O. ;
Waldmann, Patrik ;
Ericsson, Tore .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2010, 105 (490) :506-521
[7]  
Bechtold W.A., 2005, Gen. Tech. Rep. SRS-80, P80, DOI DOI 10.2737/SRS-GTR-80
[8]   APPROXIMATE INFERENCE IN GENERALIZED LINEAR MIXED MODELS [J].
BRESLOW, NE ;
CLAYTON, DG .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (421) :9-25
[9]   Marginal likelihood from the Gibbs output [J].
Chib, S .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1995, 90 (432) :1313-1321
[10]  
Crainiceanu CM, 2008, J AM STAT ASSOC, V103, P21, DOI 10.1198/016214507000001409