Consistent change-point detection with kernels

被引:42
作者
Garreau, Damien [1 ]
Arlot, Sylvain [2 ]
机构
[1] Inria Paris, Ctr rech, 2 Rue Simone Iff,CS 42112, F-75589 Paris 12, France
[2] Univ Paris Sud, CNRS, Univ Paris Saclay, Lab Math Orsay, F-91405 Orsay, France
关键词
Change-point detection; kernel methods; penalized least-squares; LEAST-SQUARES ESTIMATION; APPROXIMATION; SEGMENTATION; NUMBER;
D O I
10.1214/18-EJS1513
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper we study the kernel change-point algorithm (KCP) proposed by Arlot, Celisse and Harchaoui [5], which aims at locating an unknown number of change-points in the distribution of a sequence of independent data taking values in an arbitrary set. The change-points are selected by model selection with a penalized kernel empirical criterion. We provide a non-asymptotic result showing that, with high probability, the KCP procedure retrieves the correct number of change-points, provided that the constant in the penalty is well-chosen; in addition, KCP estimates the change-points location at the optimal rate. As a consequence, when using a characteristic kernel, KCP detects all kinds of change in the distribution (not only changes in the mean or the variance), and it is able to do so for complex structured data (not necessarily in R-d). Most of the analysis is conducted assuming that the kernel is bounded; part of the results can be extended when we only assume a finite second-order moment. We also demonstrate KCP on both synthetic and real data.
引用
收藏
页码:4440 / 4486
页数:47
相关论文
共 62 条
[41]   Least-squares estimation of an unknown number of shifts in a time series [J].
Lavielle, M ;
Moulines, E .
JOURNAL OF TIME SERIES ANALYSIS, 2000, 21 (01) :33-59
[42]  
Lavielle M, 2006, Lithuanian Mathematical Journal, V46, P287
[43]   Detecting multiple change-points in the mean of Gaussian process by model selection [J].
Lebarbier, E .
SIGNAL PROCESSING, 2005, 85 (04) :717-736
[44]  
LEDOUX M, 2013, PROBABILITY BANACH S, V23
[45]  
Li S., 2015, Adv. Neural Inf. Process. Syst., V28, P3366
[46]  
Liu J, 1997, STAT SINICA, V7, P497
[47]  
LIU S, 2017, ANN STAT
[48]   CONTROL CHARTS WITH WARNING LINES [J].
PAGE, ES .
BIOMETRIKA, 1955, 42 (1-2) :243-257
[49]   Detection of onset of neuronal activity by allowing for heterogeneity in the change points [J].
Ritov, Y ;
Raz, A ;
Bergman, H .
JOURNAL OF NEUROSCIENCE METHODS, 2002, 122 (01) :25-42
[50]  
SCHOLKOPF B, 2002, LEARNING KERNELS SUP