Marginally parameterized spatio-temporal models and stepwise maximum likelihood estimation

被引:5
作者
Edwards, Matthew [1 ,2 ]
Castruccio, Stefano [3 ]
Hammerling, Dorit [4 ]
机构
[1] Newcastle Univ, Sch Math Stat & Phys, Newcastle Upon Tyne NE1 7RU, Tyne & Wear, England
[2] Natl Innovat Ctr Data, Newcastle Upon Tyne, Tyne & Wear, England
[3] Univ Notre Dame, Dept Appl & Computat Math & Stat, Notre Dame, IN 46556 USA
[4] Colorado Sch Mines, Dept Appl Math & Stat, Golden, CO 80401 USA
基金
英国工程与自然科学研究理事会;
关键词
Spatio-temporal model; Massive data; Stepwise estimation; Parallel computation; TIME COVARIANCE FUNCTIONS; GAUSSIAN PROCESS MODELS; SPACE; NONSTATIONARY;
D O I
10.1016/j.csda.2020.107018
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In order to learn the complex features of large spatio-temporal data, models with a large number of parameters are often required. However, inference is often infeasible due to the computational and memory costs of maximum likelihood estimation (MLE). The class of marginally parameterized (MP) models is introduced, where estimation can be performed efficiently with a sequence of marginal likelihood functions with stepwise maximum likelihood estimation (SMLE). The conditions under which the stepwise estimators are consistent are provided, and it is shown that this class of models includes the diagonal vector autoregressive moving average model. It is demonstrated that the parameters of this model can be obtained at least three orders of magnitude faster with SMLE compared to MLE, with only a small loss in statistical efficiency. A MP model is applied to a spatio-temporal global climate data set consisting of over five million data points, and it is demonstrated how estimation can be achieved in less than one hour on a laptop with a dual core at 2.9 Ghz. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 43 条
[1]  
[Anonymous], 2001, All Likelihood: Statistical Modelling and Inference Using Likelihood
[2]  
[Anonymous], 1991, Nonlinear Optimization: Complexity Issues
[3]  
[Anonymous], 1977, Matekon: Translations of Russian and East European Mathematical Economics
[4]  
[Anonymous], 2005, NEW INTRO MULTIPLE T
[5]   Stationary process approximation for the analysis of large spatial datasets [J].
Banerjee, Sudipto ;
Gelfand, Alan E. ;
Finley, Andrew O. ;
Sang, Huiyan .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 :825-848
[6]   Estimating Space and Space-Time Covariance Functions for Large Data Sets: A Weighted Composite Likelihood Approach [J].
Bevilacqua, Moreno ;
Gaetan, Carlo ;
Mateu, Jorge ;
Porcu, Emilio .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2012, 107 (497) :268-280
[7]   Comparing spatio-temporal models for particulate matter in Piemonte [J].
Cameletti, Michela ;
Ignaccolo, Rosaria ;
Bande, Stefano .
ENVIRONMETRICS, 2011, 22 (08) :985-996
[8]   A scalable multi-resolution spatio-temporal model for brain activation and connectivity in fMRI data [J].
Castruccio, Stefano ;
Ombao, Hernando ;
Genton, Marc G. .
BIOMETRICS, 2018, 74 (03) :823-833
[9]   Principles for statistical inference on big spatio-temporal data from climate models [J].
Castruccio, Stefano ;
Genton, Marc G. .
STATISTICS & PROBABILITY LETTERS, 2018, 136 :92-96
[10]   An evolutionary spectrum approach to incorporate large-scale geographical descriptors on global processes [J].
Castruccio, Stefano ;
Guinness, Joseph .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2017, 66 (02) :329-344