Multiscale change point inference

被引:228
作者
Frick, Klaus [1 ]
Munk, Axel [2 ,3 ]
Sieling, Hannes [2 ]
机构
[1] Interstate Univ Appl Sci Technol, Buchs, Switzerland
[2] Univ Gottingen, D-37077 Gottingen, Germany
[3] Max Planck Inst Biophys Chem, D-37077 Gottingen, Germany
基金
美国国家科学基金会; 英国工程与自然科学研究理事会;
关键词
Multiscale methods; Dynamic programming; Honest confidence sets; Change point regression; Exponential families; COMPARATIVE GENOMIC HYBRIDIZATION; LEAST-SQUARES ESTIMATION; FALSE DISCOVERY RATE; NONPARAMETRIC REGRESSION; CONFIDENCE-REGIONS; SEQUENTIAL DETECTION; BAYESIAN-INFERENCE; DANTZIG SELECTOR; LIKELIHOOD RATIO; LINEAR-MODELS;
D O I
10.1111/rssb.12047
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We introduce a new estimator, the simultaneous multiscale change point estimator SMUCE, for the change point problem in exponential family regression. An unknown step function is estimated by minimizing the number of change points over the acceptance region of a multiscale test at a level alpha. The probability of overestimating the true number of change points K is controlled by the asymptotic null distribution of the multiscale test statistic. Further, we derive exponential bounds for the probability of underestimating K. By balancing these quantities, alpha will be chosen such that the probability of correctly estimating K is maximized. All results are even non-asymptotic for the normal case. On the basis of these bounds, we construct (asymptotically) honest confidence sets for the unknown step function and its change points. At the same time, we obtain exponential bounds for estimating the change point locations which for example yield the minimax rate O(n-1) up to a log-term. Finally, the simultaneous multiscale change point estimator achieves the optimal detection rate of vanishing signals as n ->infinity, even for an unbounded number of change points. We illustrate how dynamic programming techniques can be employed for efficient computation of estimators and confidence regions. The performance of the multiscale approach proposed is illustrated by simulations and in two cutting edge applications from genetic engineering and photoemission spectroscopy.
引用
收藏
页码:495 / 580
页数:86
相关论文
共 175 条
[1]   ON COMBINATORIAL TESTING PROBLEMS [J].
Addario-Berry, Louigi ;
Broutin, Nicolas ;
Devroye, Luc ;
Lugosi, Gabor .
ANNALS OF STATISTICS, 2010, 38 (05) :3063-3092
[2]   APPROXIMATION OF FUNCTIONALS DEPENDING ON JUMPS BY ELLIPTIC FUNCTIONALS VIA GAMMA-CONVERGENCE [J].
AMBROSIO, L ;
TORTORELLI, VM .
COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, 1990, 43 (08) :999-1036
[3]  
[Anonymous], 2001, Scan Statistics
[4]  
[Anonymous], 1995, Stochastic Geometry and its Applications
[5]  
[Anonymous], 1997, LIMIT THEOREMS CHANG
[6]  
[Anonymous], 2005, INFERENCE CHANGE POI
[7]   Bayesian-type estimators of change points [J].
Antoch, J ;
Husková, M .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2000, 91 (02) :195-208
[8]   Testing for multiple change points [J].
Antoch, Jaromir ;
Jaruskova, Daniela .
COMPUTATIONAL STATISTICS, 2013, 28 (05) :2161-2183
[9]   Adaptive multiscale detection of filamentary structures in a background of uniform random points [J].
Arias-Castro, Ery ;
Donoho, David L. ;
Huo, Xiaoming .
ANNALS OF STATISTICS, 2006, 34 (01) :326-349
[10]   DETECTION OF AN ANOMALOUS CLUSTER IN A NETWORK [J].
Arias-Castro, Ery ;
Candes, Emmanuel J. ;
Durand, Arnaud .
ANNALS OF STATISTICS, 2011, 39 (01) :278-304