CHANGE-POINT DETECTION IN MULTINOMIAL DATA WITH A LARGE NUMBER OF CATEGORIES

被引:23
|
作者
Wang, Guanghui [1 ,2 ]
Zou, Changliang [1 ,2 ]
Yin, Guosheng [3 ]
机构
[1] Nankai Univ, Inst Stat, Tianjin, Peoples R China
[2] Nankai Univ, LPMC, Tianjin, Peoples R China
[3] Univ Hong Kong, Dept Stat & Actuarial Sci, Hong Kong, Hong Kong, Peoples R China
关键词
Asymptotic normality; categorical data; high-dimensional homogeneity test; multiple change-point detection; sparse contingency table; TIME-SERIES; MULTIPLE; MODELS;
D O I
10.1214/17-AOS1610
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider a sequence of multinomial data for which the probabilities associated with the categories are subject to abrupt changes of unknown magnitudes at unknown locations. When the number of categories is comparable to or even larger than the number of subjects allocated to these categories, conventional methods such as the classical Pearson's chi-squared test and the deviance test may not work well. Motivated by high-dimensional homogeneity tests, we propose a novel change-point detection procedure that allows the number of categories to tend to infinity. The null distribution of our test statistic is asymptotically normal and the test performs well with finite samples. The number of change-points is determined by minimizing a penalized objective function based on segmentation, and the locations of the change-points are estimated by minimizing the objective function with the dynamic programming algorithm. Under some mild conditions, the consistency of the estimators of multiple change-points is established. Simulation studies show that the proposed method performs satisfactorily for identifying change-points in terms of power and estimation accuracy, and it is illustrated with an analysis of a real data set.
引用
收藏
页码:2020 / 2044
页数:25
相关论文
共 50 条
  • [31] Water quality monitoring with online change-point detection methods
    Ba, Amadou
    McKenna, Sean A.
    JOURNAL OF HYDROINFORMATICS, 2015, 17 (01) : 7 - 19
  • [32] Epidemic change-point detection in general causal time series
    Diop, Mamadou Lamine
    Kengne, William
    STATISTICS & PROBABILITY LETTERS, 2022, 184
  • [33] Change-point testing for parallel data sets with FDR control
    Cui, Junfeng
    Wang, Guanghui
    Zou, Changliang
    Wang, Zhaojun
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2023, 182
  • [34] On the isotonic change-point problem
    Shen, Gang
    Xu, Hui
    JOURNAL OF NONPARAMETRIC STATISTICS, 2013, 25 (04) : 923 - 937
  • [35] TAIL-GREEDY BOTTOM-UP DATA DECOMPOSITIONS AND FAST MULTIPLE CHANGE-POINT DETECTION
    Fryzlewicz, Piotr
    ANNALS OF STATISTICS, 2018, 46 (6B) : 3390 - 3421
  • [36] A NOVEL CHANGE-POINT APPROACH FOR THE DETECTION OF GAS EMISSION SOURCES USING REMOTELY CONTAINED CONCENTRATION DATA
    Eckley, Idris
    Kirch, Claudia
    Weber, Silke
    ANNALS OF APPLIED STATISTICS, 2020, 14 (03) : 1258 - 1284
  • [37] A Fast Screen and Shape Recognition Algorithm for Multiple Change-Point Detection
    Zhuang, Dan
    Liu, Youbo
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2018, 2018
  • [38] Generalized multiple change-point detection in the structure of multivariate, possibly high-dimensional, data sequences
    Anastasiou, Andreas
    Papanastasiou, Angelos
    STATISTICS AND COMPUTING, 2023, 33 (05)
  • [39] Bayesian Change-Point Detection via Context-Tree Weighting
    Lungu, Valentinian
    Papageorgiou, Ioannis
    Kontoyiannis, Ioannis
    2022 IEEE INFORMATION THEORY WORKSHOP (ITW), 2022, : 125 - 130
  • [40] Feature Extraction for Change-Point Detection Using Stationary Subspace Analysis
    Blythe, Duncan A. J.
    von Buenau, Paul
    Meinecke, Frank C.
    Mueller, Klaus-Robert
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (04) : 631 - 643