CHANGE-POINT DETECTION IN MULTINOMIAL DATA WITH A LARGE NUMBER OF CATEGORIES

被引:23
|
作者
Wang, Guanghui [1 ,2 ]
Zou, Changliang [1 ,2 ]
Yin, Guosheng [3 ]
机构
[1] Nankai Univ, Inst Stat, Tianjin, Peoples R China
[2] Nankai Univ, LPMC, Tianjin, Peoples R China
[3] Univ Hong Kong, Dept Stat & Actuarial Sci, Hong Kong, Hong Kong, Peoples R China
关键词
Asymptotic normality; categorical data; high-dimensional homogeneity test; multiple change-point detection; sparse contingency table; TIME-SERIES; MULTIPLE; MODELS;
D O I
10.1214/17-AOS1610
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider a sequence of multinomial data for which the probabilities associated with the categories are subject to abrupt changes of unknown magnitudes at unknown locations. When the number of categories is comparable to or even larger than the number of subjects allocated to these categories, conventional methods such as the classical Pearson's chi-squared test and the deviance test may not work well. Motivated by high-dimensional homogeneity tests, we propose a novel change-point detection procedure that allows the number of categories to tend to infinity. The null distribution of our test statistic is asymptotically normal and the test performs well with finite samples. The number of change-points is determined by minimizing a penalized objective function based on segmentation, and the locations of the change-points are estimated by minimizing the objective function with the dynamic programming algorithm. Under some mild conditions, the consistency of the estimators of multiple change-points is established. Simulation studies show that the proposed method performs satisfactorily for identifying change-points in terms of power and estimation accuracy, and it is illustrated with an analysis of a real data set.
引用
收藏
页码:2020 / 2044
页数:25
相关论文
共 50 条
  • [41] Multiple change-point detection of multivariate mean vectors with the Bayesian approach
    Cheon, Sooyoung
    Kim, Jaehee
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (02) : 406 - 415
  • [42] Application of change-point analysis to the selection of representative data in creep experiments
    Zomorodpoosh, Setareh
    Volz, Nicklas
    Neumeier, Steffen
    Roslyakova, Irina
    JOURNAL OF PHYSICS COMMUNICATIONS, 2020, 4 (07): : 1 - 11
  • [43] Nonparametric Sequential Change-Point Detection by a Vertically Trimmed Box Method
    Rafajlowicz, Ewaryst
    Pawlak, Miroslaw
    Steland, Ansgar
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2010, 56 (07) : 3621 - 3634
  • [44] mosum: A Package for Moving Sums in Change-Point Analysis
    Meier, Alexander
    Kirch, Claudia
    Cho, Haeran
    JOURNAL OF STATISTICAL SOFTWARE, 2021, 97 (08): : 1 - 42
  • [45] Cloud Incident Data Analytics: Change-point Analysis and Text Visualization
    Chang, Hsia-Ching
    Wang, Chen-Ya
    2015 48TH HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS), 2015, : 5320 - 5330
  • [46] Sequential Change-Point Detection via the Cross-Entropy Method
    Sofronov, Georgy
    Polushina, Tatiana
    Priyadarshana, Madawa
    ELEVENTH SYMPOSIUM ON NEURAL NETWORK APPLICATIONS IN ELECTRICAL ENGINEERING (NEUREL 2012), 2012,
  • [47] cpss: an package for change-point detection by sample-splitting methods
    Wang, Guanghui
    Zou, Changliang
    JOURNAL OF QUALITY TECHNOLOGY, 2023, 55 (01) : 43 - 56
  • [48] Change-Point Detection on Solar Panel Performance Using Thresholded LASSO
    Choe, Youngjun
    Guo, Weihong
    Byon, Eunshin
    Jin, Jionghua
    Li, Jingjing
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2016, 32 (08) : 2653 - 2665
  • [49] Change-point analysis in increasing dimension
    Jirak, Moritz
    JOURNAL OF MULTIVARIATE ANALYSIS, 2012, 111 : 136 - 159
  • [50] Some results on change-point detection in cross-sectional dependence of multivariate data with changes in marginal distributions
    Rohmer, Tom
    STATISTICS & PROBABILITY LETTERS, 2016, 119 : 45 - 54