Change point detection in high dimensional data with U-statistics

被引:0
作者
Boniece, B. Cooper [1 ]
Horvath, Lajos [2 ]
Jacobs, Peter M. [3 ]
机构
[1] Drexel Univ, Dept Math, Philadelphia, PA 19104 USA
[2] Univ Utah, Dept Math, Salt Lake City, UT 84112 USA
[3] Univ Utah, Sch Comp, Salt Lake City, UT 84112 USA
基金
英国科研创新办公室;
关键词
Large dimensional vectors; U-statistics; Weak convergence; Change point; Twitter data; MULTIVARIATE; DISTANCE; METRICS;
D O I
10.1007/s11749-023-00900-y
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider the problem of detecting distributional changes in a sequence of high dimensional data. Our approach combines two separate statistics stemming from L-p norms whose behavior is similar under H-0 but potentially different under HA, leading to a testing procedure that that is flexible against a variety of alternatives. We establish the asymptotic distribution of our proposed test statistics separately in cases of weakly dependent and strongly dependent coordinates as min{N, d} -> infinity, where N denotes sample size and d is the dimension, and establish consistency of testing and estimation procedures in high dimensions under one-change alternative settings. Computational studies in single and multiple change point scenarios demonstrate our method can outperform other nonparametric approaches in the literature for certain alternatives in high dimensions. We illustrate our approach through an application to Twitter data concerning the mentions of U.S. governors.
引用
收藏
页码:400 / 452
页数:53
相关论文
共 41 条
  • [1] Abadir KM, 2005, Matrix Algebra, V1
  • [2] Hypervisor-based cloud intrusion detection through online multivariate statistical change tracking
    Aldribi, Abdulaziz
    Traore, Issa
    Moa, Belaid
    Nwamuo, Onyekachi
    [J]. COMPUTERS & SECURITY, 2020, 88
  • [3] High-Dimensional Data in Genomics
    Amaratunga, Dhammika
    Cabrera, Javier
    [J]. BIOPHARMACEUTICAL APPLIED STATISTICS SYMPOSIUM, VOL 3: PHARMACEUTICAL APPLICATIONS, 2018, : 65 - 73
  • [4] Permutation tests in change point analysis
    Antoch, J
    Husková, M
    [J]. STATISTICS & PROBABILITY LETTERS, 2001, 53 (01) : 37 - 46
  • [5] Arlot S, 2019, J MACH LEARN RES, V20
  • [6] JACKKNIFING U-STATISTICS
    ARVESEN, JN
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1969, 40 (06): : 2076 - &
  • [7] Long signal change-point detection
    Biau, Gerard
    Bleakley, Kevin
    Mason, David M.
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2016, 10 (02): : 2097 - 2123
  • [8] Billingsley P., 1999, Convergence of probability measures, DOI [10.1002/9780470316962, DOI 10.1002/9780470316962]
  • [9] Breiman L., 1968, Probability
  • [10] Chakraborty S., 2021, ARXIV