Detection and inference of changes in high-dimensional linear regression with nonsparse structures

被引:0
作者
Cho, Haeran [1 ]
Kley, Tobias [2 ]
Li, Housen [2 ]
机构
[1] Univ Bristol, Sch Math, Bristol, England
[2] Univ Gottingen, Inst Math Stochast, Gottingen, Germany
关键词
covariance scanning; data segmentation; differential parameter; post-segmentation inference; simultaneous confidence interval; CONFIDENCE-INTERVALS; SPARSE; LASSO; MODELS; SEGMENTATION; RATES;
D O I
10.1093/jrsssb/qkaf029
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
For data segmentation in high-dimensional linear regression settings, the regression parameters are often assumed to be exactly sparse segment-wise, which enables many existing methods to estimate the parameters locally via & ell;1-regularized maximum-likelihood-type estimation and then contrast them for change point detection. Contrary to this common practice, we show that the exact sparsity of neither regression parameters nor their differences, a.k.a. differential parameters, is necessary for consistency in multiple change point detection. In fact, both statistically and computationally, better efficiency is attained by a simple strategy that scans for large discrepancies in local covariance between the regressors and the response. We go a step further and propose a suite of tools for directly inferring about the differential parameters post-segmentation, which are applicable even when the regression parameters themselves are nonsparse. Theoretical investigations are conducted under general conditions permitting non-Gaussianity, temporal dependence, and ultra-high dimensionality. Numerical results from simulated and macroeconomic datasets demonstrate the competitiveness and efficacy of the proposed methods. Implementation of all methods is provided in the R package inferchange on GitHub.
引用
收藏
页数:25
相关论文
共 45 条
[1]   Estimating and testing linear models with multiple structural changes [J].
Bai, JS ;
Perron, P .
ECONOMETRICA, 1998, 66 (01) :47-78
[2]   Narrowest-over-threshold detection of multiple change points and change-point-like features [J].
Baranowski, Rafal ;
Chen, Yining ;
Fryzlewicz, Piotr .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2019, 81 (03) :649-672
[3]   SIMULTANEOUS ANALYSIS OF LASSO AND DANTZIG SELECTOR [J].
Bickel, Peter J. ;
Ritov, Ya'acov ;
Tsybakov, Alexandre B. .
ANNALS OF STATISTICS, 2009, 37 (04) :1705-1732
[4]  
Bradic J, 2022, ANN STAT, V50, P615, DOI [10.1214/19-AOS1932, 10.1214/19-aos1932]
[5]  
Bühlmann P, 2011, SPRINGER SER STAT, P1, DOI 10.1007/978-3-642-20192-9
[6]   Semisupervised inference for explained variance in high dimensional linear regression and its applications [J].
Cai, T. Tony ;
Guo, Zijian .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2020, 82 (02) :391-419
[7]   ESTIMATING SPARSE PRECISION MATRIX: OPTIMAL RATES OF CONVERGENCE AND ADAPTIVE ESTIMATION [J].
Cai, T. Tony ;
Liu, Weidong ;
Zhou, Harrison H. .
ANNALS OF STATISTICS, 2016, 44 (02) :455-488
[8]   A Direct Estimation Approach to Sparse Linear Discriminant Analysis [J].
Cai, Tony ;
Liu, Weidong .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (496) :1566-1577
[9]   A Constrained l1 Minimization Approach to Sparse Precision Matrix Estimation [J].
Cai, Tony ;
Liu, Weidong ;
Luo, Xi .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (494) :594-607
[10]   NEARLY OPTIMAL CENTRAL LIMIT THEOREM AND BOOTSTRAP APPROXIMATIONS IN HIGH DIMENSIONS [J].
Chernozhukov, Victor ;
Chetverikov, Denis ;
Koike, Yuta .
ANNALS OF APPLIED PROBABILITY, 2023, 33 (03) :2374-2425