A Nonparametric Approach for Multiple Change Point Analysis of Multivariate Data

被引:371
作者
Matteson, David S. [1 ]
James, Nicholas A. [2 ]
机构
[1] Cornell Univ, Dept Stat Sci, Ithaca, NY 14853 USA
[2] Cornell Univ, Sch Operat Res & Informat Engn, Ithaca, NY 14853 USA
基金
美国国家科学基金会;
关键词
Cluster analysis; Multivariate time series; Permutation tests; Signal processing; U-statistics;
D O I
10.1080/01621459.2013.849605
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Change point analysis has applications in a wide variety of fields. The general problem concerns the inference of a change in distribution for a set of time-ordered observations. Sequential detection is an online version in which new data are continually arriving and are analyzed adaptively. We are concerned with the related, but distinct, offline version, in which retrospective analysis of an entire sequence is performed. For a set of multivariate observations of arbitrary dimension, we consider nonparametric estimation of both the number of change points and the positions at which they occur. We do not make any assumptions regarding the nature of the change in distribution or any distribution assumptions beyond the existence of the alpha th absolute moment, for some alpha epsilon (0, 2). Estimation is based on hierarchical clustering and we propose both divisive and agglomerative algorithms. The divisive method is shown to provide consistent estimates of both the number and the location of change points under standard regularity assumptions. We compare the proposed approach with competing methods in a simulation study. Methods from cluster analysis are applied to assess performance and to allow simple comparisons of location estimates, even when the estimated number differs. We conclude with applications in genetics, finance, and spatio-temporal analysis. Supplementary materials for this article are available online.
引用
收藏
页码:334 / 345
页数:12
相关论文
共 39 条
[1]  
Akoglu Leman, 2010, ARM SCI C, V1
[2]  
[Anonymous], IEEE WORKSH STAT SIG
[3]  
[Anonymous], 2012, R LANG ENV STAT COMP
[4]  
[Anonymous], 2013, ARXIV13093295
[5]  
[Anonymous], 2011, ARXIV11071971
[6]  
Arlot Sylvain., 2012, Kernel change-point detection. arXiv
[7]   Optimal rate of convergence for nonparametric change-point estimators for nonstationary sequences [J].
Ben Hariz, Samir ;
Wylie, Jonathan J. ;
Zhang, Qiang .
ANNALS OF STATISTICS, 2007, 35 (04) :1802-1826
[8]  
Bleakley K., 2011, HAL00602121 CBIO
[9]  
Bolton RJ, 2002, STAT SCI, V17, P235
[10]   HIERARCHICAL BAYESIAN-ANALYSIS OF CHANGEPOINT PROBLEMS [J].
CARLIN, BP ;
GELFAND, AE ;
SMITH, AFM .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 1992, 41 (02) :389-405