Consistent change-point detection with kernels

被引:42
作者
Garreau, Damien [1 ]
Arlot, Sylvain [2 ]
机构
[1] Inria Paris, Ctr rech, 2 Rue Simone Iff,CS 42112, F-75589 Paris 12, France
[2] Univ Paris Sud, CNRS, Univ Paris Saclay, Lab Math Orsay, F-91405 Orsay, France
关键词
Change-point detection; kernel methods; penalized least-squares; LEAST-SQUARES ESTIMATION; APPROXIMATION; SEGMENTATION; NUMBER;
D O I
10.1214/18-EJS1513
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper we study the kernel change-point algorithm (KCP) proposed by Arlot, Celisse and Harchaoui [5], which aims at locating an unknown number of change-points in the distribution of a sequence of independent data taking values in an arbitrary set. The change-points are selected by model selection with a penalized kernel empirical criterion. We provide a non-asymptotic result showing that, with high probability, the KCP procedure retrieves the correct number of change-points, provided that the constant in the penalty is well-chosen; in addition, KCP estimates the change-points location at the optimal rate. As a consequence, when using a characteristic kernel, KCP detects all kinds of change in the distribution (not only changes in the mean or the variance), and it is able to do so for complex structured data (not necessarily in R-d). Most of the analysis is conducted assuming that the kernel is bounded; part of the results can be extended when we only assume a finite second-order moment. We also demonstrate KCP on both synthetic and real data.
引用
收藏
页码:4440 / 4486
页数:47
相关论文
共 62 条
[1]  
Abou-Elailah Abdalbassir, 2015, Pattern Recognition Applications and Methods. 4th International Conference (ICPRAM 2015). Revised Selected Papers: LNCS 9493, P89, DOI 10.1007/978-3-319-27677-9_6
[2]   Adapting to unknown sparsity by controlling the false discovery rate [J].
Abramovich, Felix ;
Benjamini, Yoav ;
Donoho, David L. ;
Johnstone, Iain M. .
ANNALS OF STATISTICS, 2006, 34 (02) :584-653
[3]  
[Anonymous], 2013, P ESANN
[4]  
[Anonymous], 2012, P 25 INT C NEUR INF
[5]  
[Anonymous], 2011, Speech and Audio Signal Processing: Processing and Perception of Speech and Music, DOI 10.1002/9781118142882
[6]  
[Anonymous], 2007, Advances in Neural Information Processing Systems
[7]  
[Anonymous], 1955, Acta Mathematica Hungarica
[8]  
ARLOT S, 2012, PREPRINT
[9]   Segmentation of the mean of heteroscedastic data via cross-validation [J].
Arlot, Sylvain ;
Celisse, Alain .
STATISTICS AND COMPUTING, 2011, 21 (04) :613-632
[10]   THEORY OF REPRODUCING KERNELS [J].
ARONSZAJN, N .
TRANSACTIONS OF THE AMERICAN MATHEMATICAL SOCIETY, 1950, 68 (MAY) :337-404