MULTIVARIATE SPATIO-TEMPORAL MODELS FOR HIGH-DIMENSIONAL AREAL DATA WITH APPLICATION TO LONGITUDINAL EMPLOYER-HOUSEHOLD DYNAMICS

被引:62
作者
Bradley, Jonathan R. [1 ]
Holan, Scott H. [1 ]
Wikle, Christopher K. [1 ]
机构
[1] Univ Missouri, Dept Stat, 146 Middlebush Hall, Columbia, MO 65211 USA
基金
美国国家科学基金会;
关键词
Bayesian hierarchical model; Longitudinal Employer-Household Dynamics (LEHD) program; Kalman filter; Markov chain Monte Carlo; multivariate spatio-temporal data; Moran's I basis; SPATIAL-FILTERING SPECIFICATION;
D O I
10.1214/15-AOAS862
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Many data sources report related variables of interest that are also referenced over geographic regions and time; however, there are relatively few general statistical methods that one can readily use that incorporate these multivariate spatio-temporal dependencies. Additionally, many multivariate spatio-temporal areal data sets are extremely high dimensional, which leads to practical issues when formulating statistical models. For example, we analyze Quarterly Workforce Indicators (QWI) published by the US Census Bureau's Longitudinal Employer-Household Dynamics (LEHD) program. QWIs are available by different variables, regions, and time points, resulting in millions of tabulations. Despite their already expansive coverage, by adopting a fully Bayesian framework, the scope of the QWIs can be extended to provide estimates of missing values along with associated measures of uncertainty. Motivated by the LEHD, and other applications in federal statistics, we introduce the multivariate spatio-temporal mixed effects model (MSTM), which can be used to efficiently model high-dimensional multivariate spatio-temporal areal data sets. The proposed MSTM extends the notion of Moran's I basis functions to the multivariate spatio-temporal setting. This extension leads to several methodological contributions, including extremely effective dimension reduction, a dynamic linear model for multivariate spatio-temporal areal processes, and the reduction of a high-dimensional parameter space using a novel parameter model.
引用
收藏
页码:1761 / 1791
页数:31
相关论文
共 55 条
[1]  
Abowd JM, 2009, STUD INCOME, V68, P149
[2]  
Aldworth J, 1999, STAT TEXTB MONOG, V159, P1
[3]  
ALLEGRETTO S., 2013, WORKING PAPER SERIES, V1-63
[4]  
[Anonymous], STAT SPATIOTEMPORAL
[5]  
[Anonymous], TIME SERIES ANAL ITS
[6]  
[Anonymous], 2013, J PRIVACY CONFIDENTI
[7]  
Banerjee S., 2004, HIERARCHICAL MODELIN
[8]   Stationary process approximation for the analysis of large spatial datasets [J].
Banerjee, Sudipto ;
Gelfand, Alan E. ;
Finley, Andrew O. ;
Sang, Huiyan .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 :825-848
[9]   Hierarchical Spatial Process Models for Multiple Traits in Large Genetic Trials [J].
Banerjee, Sudipto ;
Finley, Andrew O. ;
Waldmann, Patrik ;
Ericsson, Tore .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2010, 105 (490) :506-521
[10]  
Bell W.R., 1990, Survey Methodology, V16, P195