Estimation and Prediction in Spatial Models With Block Composite Likelihoods

被引:90
作者
Eidsvik, Jo [1 ]
Shaby, Benjamin A. [2 ]
Reich, Brian J. [3 ]
Wheeler, Matthew [4 ]
Niemi, Jarad [5 ]
机构
[1] NTNU, Dept Math Sci, N-7491 Trondheim, Norway
[2] Penn State Univ, Dept Stat, University Pk, PA 16802 USA
[3] N Carolina State Univ, Dept Stat, Raleigh, NC 27695 USA
[4] Univ Calif Santa Barbara, Dept Stat & Appl Probabil, Santa Barbara, CA 93106 USA
[5] Iowa State Univ, Dept Stat, Ames, IA 50011 USA
基金
美国国家科学基金会;
关键词
Gaussian process; GPU; Large datasets; Parallel computing; Spatial statistics; DATA SETS; COVARIANCE; DATASETS;
D O I
10.1080/10618600.2012.760460
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This article develops a block composite likelihood for estimation and prediction in large spatial datasets. The composite likelihood (CL) is constructed from the joint densities of pairs of adjacent spatial blocks. This allows large datasets to be split into many smaller datasets, each of which can be evaluated separately, and combined through a simple summation. Estimates for unknown parameters are obtained by maximizing the block CL function. In addition, a new method for optimal spatial prediction under the block CL is presented. Asymptotic variances for both parameter estimates and predictions are computed using Godambe sandwich matrices. The approach considerably improves computational efficiency, and the composite structure obviates the need to load entire datasets into memory at once, completely avoiding memory limitations imposed by massive datasets. Moreover, computing time can be reduced even further by distributing the operations using parallel computing. A simulation study shows that CL estimates and predictions, as well as their corresponding asymptotic confidence intervals, are competitive with those based on the full likelihood. The procedure is demonstrated on one dataset from the mining industry and one dataset of satellite retrievals. The real-data examples show that the block composite results tend to outperform two competitors; the predictive process model and fixed-rank kriging. Supplementary materials for this article is available online on the journal web site.
引用
收藏
页码:295 / 315
页数:21
相关论文
共 33 条
[1]  
[Anonymous], 1993, J AGR BIOL ENVIR ST
[2]   Parameter estimation in high dimensional Gaussian distributions [J].
Aune, Erlend ;
Simpson, Daniel P. ;
Eidsvik, Jo .
STATISTICS AND COMPUTING, 2014, 24 (02) :247-263
[3]   Joint composite estimating functions in spatiotemporal models [J].
Bai, Yun ;
Song, Peter X. -K. ;
Raghunathan, T. E. .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2012, 74 :799-824
[4]   Stationary process approximation for the analysis of large spatial datasets [J].
Banerjee, Sudipto ;
Gelfand, Alan E. ;
Finley, Andrew O. ;
Sang, Huiyan .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 :825-848
[5]   Estimating Space and Space-Time Covariance Functions for Large Data Sets: A Weighted Composite Likelihood Approach [J].
Bevilacqua, Moreno ;
Gaetan, Carlo ;
Mateu, Jorge ;
Porcu, Emilio .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2012, 107 (497) :268-280
[6]   SPATIAL MODELS GENERATED BY NESTED STOCHASTIC PARTIAL DIFFERENTIAL EQUATIONS, WITH AN APPLICATION TO GLOBAL OZONE MAPPING [J].
Bolin, David ;
Lindgren, Finn .
ANNALS OF APPLIED STATISTICS, 2011, 5 (01) :523-550
[7]   Asymptotic properties of computationally efficient alternative estimators for a class of multivariate normal models [J].
Caragea, Petruta C. ;
Smith, Richard L. .
JOURNAL OF MULTIVARIATE ANALYSIS, 2007, 98 (07) :1417-1440
[8]   Fixed rank kriging for very large spatial data sets [J].
Cressie, Noel ;
Johannesson, Gardar .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 :209-226
[9]   A composite likelihood approach to semivariogram estimation [J].
Curriero, FC ;
Lele, S .
JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 1999, 4 (01) :9-28
[10]   Bayesian geostatistical design [J].
Diggle, P ;
Lophaven, S .
SCANDINAVIAN JOURNAL OF STATISTICS, 2006, 33 (01) :53-64