Bayesian nonstationary spatial modeling for very large datasets

被引:50
作者
Katzfuss, Matthias [1 ]
机构
[1] Heidelberg Univ, Inst Angew Math, D-69120 Heidelberg, Germany
关键词
covariance tapering; full-scale approximation; low-rank models; massive datasets; model selection; reversible-jump MCMC; STATISTICAL-ANALYSIS; DATA SETS; SPACE; APPROXIMATION; PREDICTION; FIELDS;
D O I
10.1002/env.2200
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
With the proliferation of modern high-resolution measuring instruments mounted on satellites, planes, ground-based vehicles, and monitoring stations, a need has arisen for statistical methods suitable for the analysis of large spatial datasets observed on large spatial domains. Statistical analyses of such datasets provide two main challenges: first, traditional spatial-statistical techniques are often unable to handle large numbers of observations in a computationally feasible way; second, for large and heterogeneous spatial domains, it is often not appropriate to assume that a process of interest is stationary over the entire domain. We address the first challenge by using a model combining a low-rank component, which allows for flexible modeling of medium-to-long-range dependence via a set of spatial basis functions, with a tapered remainder component, which allows for modeling of local dependence using a compactly supported covariance function. Addressing the second challenge, we propose two extensions to this model that result in increased flexibility: first, the model is parameterized on the basis of a nonstationary Matern covariance, where the parameters vary smoothly across space; second, in our fully Bayesian model, all components and parameters are considered random, including the number, locations, and shapes of the basis functions used in the low-rank component. Using simulated data and a real-world dataset of high-resolution soil measurements, we show that both extensions can result in substantial improvements over the current state-of-the-art. Copyright (c) 2013 John Wiley & Sons, Ltd.
引用
收藏
页码:189 / 200
页数:12
相关论文
共 60 条
  • [1] [Anonymous], 1999, INTERPOLATION SPATIA
  • [2] [Anonymous], 2005, Nonstationary spatial covariance functions Technical Report No. 21 Center for Integrating Statistical and Environmental Science
  • [3] Banerjee S., 2003, Hierarchical modeling and analysis for spatial data
  • [4] Stationary process approximation for the analysis of large spatial datasets
    Banerjee, Sudipto
    Gelfand, Alan E.
    Finley, Andrew O.
    Sang, Huiyan
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 : 825 - 848
  • [5] Berliner LM, 2000, J CLIMATE, V13, P3953, DOI 10.1175/1520-0442(2001)013<3953:LLPOPS>2.0.CO
  • [6] 2
  • [7] Estimating Space and Space-Time Covariance Functions for Large Data Sets: A Weighted Composite Likelihood Approach
    Bevilacqua, Moreno
    Gaetan, Carlo
    Mateu, Jorge
    Porcu, Emilio
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2012, 107 (497) : 268 - 280
  • [8] Dynamic factor process convolution models for multivariate space - time data with application to air quality assessment
    Calder, Catherine A.
    [J]. ENVIRONMENTAL AND ECOLOGICAL STATISTICS, 2007, 14 (03) : 229 - 247
  • [9] High-Resolution Digital Soil Mapping: Kriging for Very Large Datasets
    Cressie, N.
    Kang, E. L.
    [J]. PROXIMAL SOIL SENSING, 2010, : 49 - 63
  • [10] Cressie N., 1993, Statistics for Spatial Data, DOI [10.1002/9781119115151, DOI 10.1002/9781119115151]