CHANGE-POINT DETECTION IN MULTINOMIAL DATA WITH A LARGE NUMBER OF CATEGORIES

被引:23
|
作者
Wang, Guanghui [1 ,2 ]
Zou, Changliang [1 ,2 ]
Yin, Guosheng [3 ]
机构
[1] Nankai Univ, Inst Stat, Tianjin, Peoples R China
[2] Nankai Univ, LPMC, Tianjin, Peoples R China
[3] Univ Hong Kong, Dept Stat & Actuarial Sci, Hong Kong, Hong Kong, Peoples R China
关键词
Asymptotic normality; categorical data; high-dimensional homogeneity test; multiple change-point detection; sparse contingency table; TIME-SERIES; MULTIPLE; MODELS;
D O I
10.1214/17-AOS1610
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider a sequence of multinomial data for which the probabilities associated with the categories are subject to abrupt changes of unknown magnitudes at unknown locations. When the number of categories is comparable to or even larger than the number of subjects allocated to these categories, conventional methods such as the classical Pearson's chi-squared test and the deviance test may not work well. Motivated by high-dimensional homogeneity tests, we propose a novel change-point detection procedure that allows the number of categories to tend to infinity. The null distribution of our test statistic is asymptotically normal and the test performs well with finite samples. The number of change-points is determined by minimizing a penalized objective function based on segmentation, and the locations of the change-points are estimated by minimizing the objective function with the dynamic programming algorithm. Under some mild conditions, the consistency of the estimators of multiple change-points is established. Simulation studies show that the proposed method performs satisfactorily for identifying change-points in terms of power and estimation accuracy, and it is illustrated with an analysis of a real data set.
引用
收藏
页码:2020 / 2044
页数:25
相关论文
共 50 条
  • [1] Multiple Change-Point Detection in Spatiotemporal Seismicity Data
    Fiedler, Bernhard
    Zoeller, Gert
    Holschneider, Matthias
    Hainzl, Sebastian
    BULLETIN OF THE SEISMOLOGICAL SOCIETY OF AMERICA, 2018, 108 (3A) : 1147 - 1159
  • [2] The Bethe Hessian and Information Theoretic Approaches for Online Change-Point Detection in Network Data
    Hwang, Neil
    Xu, Jiarui
    Bhattacharyya, Sharmodeep
    Chatterjee, Shirshendu
    SANKHYA-SERIES A-MATHEMATICAL STATISTICS AND PROBABILITY, 2022, 84 (01): : 283 - 320
  • [3] On the Performances of Trend and Change-Point Detection Methods for Remote Sensing Data
    Militino, Ana F.
    Moradi, Mehdi
    Dolores Ugarte, M.
    REMOTE SENSING, 2020, 12 (06)
  • [4] Change-point detection for shifts in control charts using fuzzy shift change-point algorithms
    Lu, Kang-Ping
    Chang, Shao-Tung
    Yang, Miin-Shen
    COMPUTERS & INDUSTRIAL ENGINEERING, 2016, 93 : 12 - 27
  • [5] Identifying localized changes in large systems: Change-point detection for biomolecular simulations
    Fan, Zhou
    Dror, Ron O.
    Mildorf, Thomas J.
    Piana, Stefano
    Shaw, David E.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (24) : 7454 - 7459
  • [6] Optimal Change-Point Detection With Training Sequences in the Large and Moderate Deviations Regimes
    He, Haiyun
    Zhang, Qiaosheng
    Tan, Vincent Y. F.
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2021, 67 (10) : 6758 - 6784
  • [7] A general procedure for change-point detection in multivariate time series
    Diop, Mamadou Lamine
    Kengne, William
    TEST, 2023, 32 (01) : 1 - 33
  • [8] Change-point detection with deep learning: A review
    Xu, Ruiyu
    Song, Zheren
    Wu, Jianguo
    Wang, Chao
    Zhou, Shiyu
    FRONTIERS OF ENGINEERING MANAGEMENT, 2025, 12 (01) : 154 - 176
  • [9] Multiple change-point detection for regression curves
    Wang, Yunlong
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2024, 52 (04):
  • [10] Simultaneous change-point detection and curve estimation
    Lu, Zhaoying
    Hao, Ning
    Zhang, Hao Helen
    STATISTICS AND ITS INTERFACE, 2024, 17 (03) : 493 - 500