Space-Time Data fusion Under Error in Computer Model Output: An Application to Modeling Air Quality

被引:151
作者
Berrocal, Veronica J. [1 ]
Gelfand, Alan E. [2 ]
Holland, David M. [3 ]
机构
[1] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
[2] Duke Univ, Dept Stat Sci, Durham, NC 27708 USA
[3] US EPA, Natl Exposure Res Lab, Res Triangle Pk, NC 27711 USA
关键词
Change of support; Data fusion; Gaussian Markov random field; Numerical model calibration; Smoothing; Spatially varying random weights; DOWNSCALER;
D O I
10.1111/j.1541-0420.2011.01725.x
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We provide methods that can be used to obtain more accurate environmental exposure assessment. In particular, we propose two modeling approaches to combine monitoring data at point level with numerical model output at grid cell level, yielding improved prediction of ambient exposure at point level. Extending our earlier downscaler model (Berrocal, V. J., Gelfand, A. E., and Holland, D. M. (2010b). A spatio-temporal downscaler for outputs from numerical models. Journal of Agricultural, Biological and Environmental Statistics 15, 176197), these new models are intended to address two potential concerns with the model output. One recognizes that there may be useful information in the outputs for grid cells that are neighbors of the one in which the location lies. The second acknowledges potential spatial misalignment between a station and its putatively associated grid cell. The first model is a Gaussian Markov random field smoothed downscaler that relates monitoring station data and computer model output via the introduction of a latent Gaussian Markov random field linked to both sources of data. The second model is a smoothed downscaler with spatially varying random weights defined through a latent Gaussian process and an exponential kernel function, that yields, at each site, a new variable on which the monitoring station data is regressed with a spatial linear model. We applied both methods to daily ozone concentration data for the Eastern US during the summer months of June, July and August 2001, obtaining, respectively, a 5% and a 15% predictive gain in overall predictive mean square error over our earlier downscaler model (Berrocal et al., 2010b). Perhaps more importantly, the predictive gain is greater at hold-out sites that are far from monitoring sites.
引用
收藏
页码:837 / 848
页数:12
相关论文
共 23 条
  • [1] [Anonymous], 2006, Multivariate Geostatistics: An Introduction With Applications
  • [2] [Anonymous], 1993, J AGR BIOL ENVIR ST
  • [3] Banerjee S., 2003, Hierarchical modeling and analysis for spatial data
  • [4] Stationary process approximation for the analysis of large spatial datasets
    Banerjee, Sudipto
    Gelfand, Alan E.
    Finley, Andrew O.
    Sang, Huiyan
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 : 825 - 848
  • [5] A BIVARIATE SPACE-TIME DOWNSCALER UNDER SPACE AND TIME MISALIGNMENT
    Berrocal, Veronica J.
    Gelfand, Alan E.
    Holland, David M.
    [J]. ANNALS OF APPLIED STATISTICS, 2010, 4 (04) : 1942 - 1975
  • [6] A Spatio-Temporal Downscaler for Output From Numerical Models
    Berrocal, Veronica J.
    Gelfand, Alan E.
    Holland, David M.
    [J]. JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2010, 15 (02) : 176 - 197
  • [7] BAYESIAN COMPUTATION AND STOCHASTIC-SYSTEMS
    BESAG, J
    GREEN, P
    HIGDON, D
    MENGERSEN, K
    [J]. STATISTICAL SCIENCE, 1995, 10 (01) : 3 - 41
  • [8] BESAG J, 1974, J ROY STAT SOC B MET, V36, P192
  • [9] Review of the governing equations, computational algorithms, and other components of the models-3 Community Multiscale Air Quality (CMAQ) modeling system
    Byun, Daewon
    Schere, Kenneth L.
    [J]. APPLIED MECHANICS REVIEWS, 2006, 59 (1-6) : 51 - 77
  • [10] CARTER CK, 1994, BIOMETRIKA, V81, P541