Statistically and computationally efficient change point localization in regression settings

被引:0
作者
Wang, Daren [1 ]
Zhao, Zifeng [2 ]
Lin, Kevin Z. [3 ]
Willett, Rebecca [4 ]
机构
[1] Department of ACMS, University of Notre Dame, Indiana, United States
[2] Mendoza College of Business, University of Notre Dame, Indiana, United States
[3] Wharton Department of Statistics and Data Science, University of Pennsylvania, Pennsylvania, United States
[4] Department of Statistics, University of Chicago, Illinois, United States
关键词
Time series analysis;
D O I
暂无
中图分类号
学科分类号
摘要
Detecting when the underlying distribution changes for the observed time series is a fundamental problem arising in a broad spectrum of applications. In this paper, we study multiple change-point localization in the high-dimensional regression setting, which is particularly challenging as no direct observations of the parameter of interest is available. Specifically, we assume we observe {xt, yt}nt=1 where {xt}nt=1 are p-dimensional covariates, {yt}nt=1 are the univariate responses satisfying E(yt) = x>t βt∗ for 1 ≤ t ≤ n and {βt∗}nt=1 are the unobserved regression coefficients that change over time in a piecewise constant manner. We propose a novel projection-based algorithm, Variance Projected Wild Binary Segmentation (VPWBS), which transforms the original (difficult) problem of change-point detection in p-dimensional regression to a simpler problem of change-point detection in mean of a one-dimensional time series. VPWBS is shown to achieve sharp localization rate Op(1/n) up to a log factor, a significant improvement from the best rate Op(1/√n) known in the existing literature for multiple change-point localization in high-dimensional regression. Extensive numerical experiments are conducted to demonstrate the robust and favorable performance of VPWBS over two state-of-the-art algorithms, especially when the size of change in the regression coefficients {βt∗}nt=1 is small. ©2021 Daren Wang, Zifeng Zhao, Kevin Z. Lin and Rebecca Willett.
引用
收藏
相关论文
empty
未找到相关数据