HIGH-DIMENSIONAL LINEAR REGRESSION FOR DEPENDENT DATA WITH APPLICATIONS TO NOWCASTING

被引:14
作者
Han, Yuefeng [1 ]
Tsay, Ruey S. [2 ]
机构
[1] Univ Chicago, 5747 South Ellis Ave, Chicago, IL 60637 USA
[2] Univ Chicago, 5807 South Woodlawn Ave, Chicago, IL 60637 USA
关键词
Consistency; forecasting; high-dimensional time series; Lasso; mixed-frequency data; model selection; nowcasting; MODEL SELECTION; LASSO; FREEDOM;
D O I
10.5705/ss.202018.0044
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Recent research has focused on l(1) penalized least squares (Lasso) estimators for high-dimensional linear regressions in which the number of covariates p is considerably larger than the sample size n. However, few studies have examined the properties of the estimators when the errors and/or the covariates are serially dependent. In this study, we investigate the theoretical properties of the Lasso estimator for a linear regression with a random design and weak sparsity under serially dependent and/or nonsubGaussian errors and covariates. In contrast to the traditional case, in which the errors are independent and identically distributed and have finite exponential moments, we show that p can be at most a power of n if the errors have only finite polynomial moments. In addition, the rate of convergence becomes slower owing to the serial dependence in the errors and the covariates. We also consider the sign consistency of the model selection using the Lasso estimator when there are serial correlations in the errors or the covariates, or both. Adopting the framework of a functional dependence measure, we describe how the rates of convergence and the selection consistency of the estimators depend on the dependence measures and moment conditions of the errors and the covariates. Simulation results show that a Lasso regression can be significantly more powerful than a mixed-frequency data sampling regression (MIDAS) and a Dantzig selector in the presence of irrelevant variables. We apply the results obtained for the Lasso method to nowcasting with mixed-frequency data, in which serially correlated errors and a large number of covariates are common. The empirical results show that the Lasso procedure outperforms the MIDAS regression and the autoregressive model with exogenous variables in terms of both forecasting and nowcasting.
引用
收藏
页码:1797 / 1827
页数:31
相关论文
共 28 条
[1]  
[Anonymous], 2005, Analysis of financial time series', DOI DOI 10.1002/0471746193
[2]  
[Anonymous], 2006, Journal of the Royal Statistical Society, Series B
[3]  
[Anonymous], 1990, Non-linear Time Series: A Dynamical System Approach
[4]  
[Anonymous], 1988, Nonlinear and Nonstationary Time Series Analysis
[5]   REGULARIZED ESTIMATION IN SPARSE HIGH-DIMENSIONAL TIME SERIES MODELS [J].
Basu, Sumanta ;
Michailidis, George .
ANNALS OF STATISTICS, 2015, 43 (04) :1535-1567
[6]   SIMULTANEOUS ANALYSIS OF LASSO AND DANTZIG SELECTOR [J].
Bickel, Peter J. ;
Ritov, Ya'acov ;
Tsybakov, Alexandre B. .
ANNALS OF STATISTICS, 2009, 37 (04) :1705-1732
[7]  
Bühlmann P, 2011, SPRINGER SER STAT, P1, DOI 10.1007/978-3-642-20192-9
[8]  
Candes E, 2007, ANN STAT, V35, P2313, DOI 10.1214/009053606000001523
[9]   COVARIANCE AND PRECISION MATRIX ESTIMATION FOR HIGH-DIMENSIONAL TIME SERIES [J].
Chen, Xiaohui ;
Xu, Mengyu ;
Wu, Wei Biao .
ANNALS OF STATISTICS, 2013, 41 (06) :2994-3021
[10]   Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499