FRECHET CHANGE-POINT DETECTION

被引:23
作者
Dubey, Paromita [1 ]
Mueller, Hans-Georg [1 ]
机构
[1] Univ Calif Davis, Dept Stat, Davis, CA 95616 USA
关键词
Bootstrap; Brownian bridge; dynamics of networks; empirical processes; graph Laplacians; metric space; object data; random densities; random objects; LIKELIHOOD RATIO TESTS; MULTIVARIATE; EMERGENCE; VARIANCE;
D O I
10.1214/19-AOS1930
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose a method to infer the presence and location of change-points in the distribution of a sequence of independent data taking values in a general metric space, where change-points are viewed as locations at which the distribution of the data sequence changes abruptly in terms of either its Frechet mean, Frechet variance or both. The proposed method is based on comparisons of Frechet variances before and after putative change-point locations and does not require a tuning parameter, except for the specification of cut-off intervals near the endpoints where change-points are assumed not to occur. Our results include theoretical guarantees for consistency of the test under contiguous alternatives when a change-point exists and also for consistency of the estimated location of the change-point, if it exists, where, under the null hypothesis of no change-point, the limit distribution of the proposed scan function is the square of a standardized Brownian bridge. These consistency results are applicable for a broad class of metric spaces under mild entropy conditions. Examples include the space of univariate probability distributions and the space of graph Laplacians for networks. Simulation studies demonstrate the effectiveness of the proposed methods, both for inferring the presence of a change-point and estimating its location. We also develop theory that justifies bootstrap-based inference and illustrate the new approach with sequences of maternal fertility distributions and communication networks.
引用
收藏
页码:3312 / 3335
页数:24
相关论文
共 43 条
  • [1] [Anonymous], 2018, ARXIV180602740
  • [2] [Anonymous], 1948, ANN I HENRI POINCARE
  • [3] [Anonymous], 2015, 29 AAAI C ART INT
  • [4] A remark on approximate M-estimators
    Arcones, MA
    [J]. STATISTICS & PROBABILITY LETTERS, 1998, 38 (04) : 311 - 321
  • [5] Arlot S., 2012, ARXIV12023878
  • [6] Emergence of scaling in random networks
    Barabási, AL
    Albert, R
    [J]. SCIENCE, 1999, 286 (5439) : 509 - 512
  • [7] A comparison of normalization methods for high density oligonucleotide array data based on variance and bias
    Bolstad, BM
    Irizarry, RA
    Åstrand, M
    Speed, TP
    [J]. BIOINFORMATICS, 2003, 19 (02) : 185 - 193
  • [8] CARLSTEIN E., 1994, LECT NOTES MONOGRAPH
  • [9] GEODESIC PCA VERSUS LOG-PCA OF HISTOGRAMS IN THE WASSERSTEIN SPACE
    Cazelles, Elsa
    Seguy, Vivien
    Bigot, Jeremie
    Cuturi, Marco
    Papadakis, Nicolas
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2018, 40 (02) : B429 - B456
  • [10] A New Graph-Based Two-Sample Test for Multivariate and Object Data
    Chen, Hao
    Friedman, Jerome H.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (517) : 397 - 409