An adaptive approach for online monitoring of large-scale data streams

被引:1
作者
Cao, Shuchen [1 ]
Zhang, Ruizhi [2 ]
机构
[1] Univ Nebraska Lincoln, Dept Stat, Lincoln, NE USA
[2] Univ Georgia, Dept Stat, Athens, GA USA
关键词
False discovery rate; CUSUM; quickest change detection; process control; FALSE DISCOVERY RATE; CHANGE-POINT DETECTION; CHANGEPOINT DETECTION; SCHEMES;
D O I
10.1080/24725854.2023.2281580
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In this article, we propose an adaptive top-r method to monitor large-scale data streams where the change may affect a set of unknown data streams at some unknown time. Motivated by parallel and distributed computing, we propose to develop global monitoring schemes by parallel running local detection procedures and then use the Benjamin-Hochberg false discovery rate control procedure to estimate the number of changed data streams adaptively. Our approach is illustrated in two concrete examples: one is a homogeneous case when all data streams are independent and identically distributed with the same known pre-change and post-change distributions. The other is when all data are normally distributed, and the mean shifts are unknown and can be positive or negative. Theoretically, we show that when the pre-change and post-change distributions are completely specified, our proposed method can estimate the number of changed data streams for both the pre-change and post-change status. Moreover, we perform simulations and two case studies to show its detection efficiency.
引用
收藏
页码:119 / 130
页数:12
相关论文
共 43 条
[1]  
[Anonymous], 2006, Statistical Methodology
[2]  
[Anonymous], 1963, Theory Probab. Appl., DOI [10.1137/1108002, DOI 10.1137/1108002]
[3]  
Apley DW, 2002, J QUAL TECHNOL, V34, P80
[4]   A step-down multiple hypotheses testing procedure that controls the false discovery rate under independence [J].
Benjamini, Y ;
Liu, W .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1999, 82 (1-2) :163-170
[5]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[6]   OPTIMAL SEQUENTIAL DETECTION IN MULTI-STREAM DATA [J].
Chan, Hock Peng .
ANNALS OF STATISTICS, 2017, 45 (06) :2736-2763
[7]   A False Discovery Rate Oriented Approach to Parallel Sequential Change Detection Problems [J].
Chen, Jie ;
Zhang, Wenyi ;
Poor, H. Vincent .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 (68) :1823-1836
[8]   Second-Order Asymptotic Optimality in Multisensor Sequential Change Detection [J].
Fellouris, Georgios ;
Sokolov, Grigory .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2016, 62 (06) :3662-3675
[9]   Non-restarting cumulative sum charts and control of the false discovery rate [J].
Gandy, Axel ;
Lau, F. Din-Houn .
BIOMETRIKA, 2013, 100 (01) :261-268
[10]   AN ADAPTIVE STEP-DOWN PROCEDURE WITH PROVEN FDR CONTROL UNDER INDEPENDENCE [J].
Gavrilov, Yulia ;
Benjamini, Yoav ;
Sarkar, Sanat K. .
ANNALS OF STATISTICS, 2009, 37 (02) :619-629