An adaptive approach for online monitoring of large-scale data streams

被引:0
|
作者
Cao, Shuchen [1 ]
Zhang, Ruizhi [2 ]
机构
[1] Univ Nebraska Lincoln, Dept Stat, Lincoln, NE USA
[2] Univ Georgia, Dept Stat, Athens, GA USA
关键词
False discovery rate; CUSUM; quickest change detection; process control; FALSE DISCOVERY RATE; CHANGE-POINT DETECTION; CHANGEPOINT DETECTION; SCHEMES;
D O I
10.1080/24725854.2023.2281580
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In this article, we propose an adaptive top-r method to monitor large-scale data streams where the change may affect a set of unknown data streams at some unknown time. Motivated by parallel and distributed computing, we propose to develop global monitoring schemes by parallel running local detection procedures and then use the Benjamin-Hochberg false discovery rate control procedure to estimate the number of changed data streams adaptively. Our approach is illustrated in two concrete examples: one is a homogeneous case when all data streams are independent and identically distributed with the same known pre-change and post-change distributions. The other is when all data are normally distributed, and the mean shifts are unknown and can be positive or negative. Theoretically, we show that when the pre-change and post-change distributions are completely specified, our proposed method can estimate the number of changed data streams for both the pre-change and post-change status. Moreover, we perform simulations and two case studies to show its detection efficiency.
引用
收藏
页码:119 / 130
页数:12
相关论文
共 50 条
  • [1] SCALABLE SUM-SHRINKAGE SCHEMES FOR DISTRIBUTED MONITORING LARGE-SCALE DATA STREAMS
    Liu, Kun
    Zhang, Ruizhi
    Mei, Yajun
    STATISTICA SINICA, 2019, 29 (01) : 1 - 22
  • [2] An Efficient Online Monitoring Method for High-Dimensional Data Streams
    Zou, Changliang
    Wang, Zhaojun
    Jiang, Wei
    Zi, Xuemin
    TECHNOMETRICS, 2015, 57 (03) : 374 - 387
  • [3] Robust change detection for large-scale data streams
    Zhang, Ruizhi
    Mei, Yajun
    Shi, Jianjun
    SEQUENTIAL ANALYSIS-DESIGN METHODS AND APPLICATIONS, 2022, 41 (01): : 1 - 19
  • [4] A two-stage online monitoring procedure for high-dimensional data streams
    Li, Jun
    JOURNAL OF QUALITY TECHNOLOGY, 2019, 51 (04) : 392 - 406
  • [5] A Nonparametric Adaptive Sampling Strategy for Online Monitoring of Big Data Streams
    Xian, Xiaochen
    Wang, Andi
    Liu, Kaibo
    TECHNOMETRICS, 2018, 60 (01) : 14 - 25
  • [6] Continuous monitoring for changepoints in data streams using adaptive estimation
    Bodenham, Dean A.
    Adams, Niall M.
    STATISTICS AND COMPUTING, 2017, 27 (05) : 1257 - 1270
  • [7] Continuous monitoring for changepoints in data streams using adaptive estimation
    Dean A. Bodenham
    Niall M. Adams
    Statistics and Computing, 2017, 27 : 1257 - 1270
  • [8] A new nonparametric monitoring of data streams for changes in location and scale via Cucconi statistic
    Xiang, Dongdong
    Gao, Shulin
    Li, Wendong
    Pu, Xiaolong
    Dou, Wen
    JOURNAL OF NONPARAMETRIC STATISTICS, 2019, 31 (03) : 743 - 760
  • [9] A spatially adaptive large-scale multiple-testing procedure
    Han, Yixin
    Wang, Yunlong
    Wang, Zhaojun
    STAT, 2023, 12 (01):
  • [10] Extended likelihood approach to large-scale multiple testing
    Lee, Youngjo
    Bjornstad, Jan F.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2013, 75 (03) : 553 - 575