Change Detection in Streaming Multivariate Data Using Likelihood Detectors

被引:79
作者
Kuncheva, Ludmila I. [1 ]
机构
[1] Bangor Univ, Sch Comp Sci, Bangor LL57 1UT, Gwynedd, Wales
关键词
Change detection; multidimensional data streams; Hotelling's T-square; log-likelihood detector; DRIFT;
D O I
10.1109/TKDE.2011.226
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Change detection in streaming data relies on a fast estimation of the probability that the data in two consecutive windows come from different distributions. Choosing the criterion is one of the multitude of questions that need to be addressed when designing a change detection procedure. This paper gives a log-likelihood justification for two well-known criteria for detecting change in streaming multidimensional data: Kullback-Leibler (K-L) distance and Hotelling's T-square test for equal means (H). We propose a semiparametric log-likelihood criterion (SPLL) for change detection. Compared to the existing log-likelihood change detectors, SPLL trades some theoretical rigor for computation simplicity. We examine SPLL together with K-L and H on detecting induced change on 30 real data sets. The criteria were compared using the area under the respective Receiver Operating Characteristic (ROC) curve (AUC). SPLL was found to be on the par with H and better than K-L for the nonnormalized data, and better than both on the normalized data.
引用
收藏
页码:1175 / 1180
页数:6
相关论文
共 21 条
[1]  
Adams RP, 2007, TECHNICAL REPORT
[2]  
Aggarwal Charu C, 2007, Data Streams: Models and Algorithms, V31
[3]  
[Anonymous], 2007, Uci machine learning repository
[4]  
Basseville M, 1993, DETECTION ABRUPT CHA
[5]  
Bifet A, 2007, PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, P443
[6]  
Dasu T., 2006, PROC 38TH SYMP INTER
[7]  
Everitt B.S., 2001, A Handbook of Statistical Analysis Using S-PLUS, V2nd
[8]  
Fawcett T., 2003, TECHNICAL REPORT HPL
[9]  
Gama J, 2004, LECT NOTES ARTIF INT, V3171, P286
[10]  
Ho S.-S., 2005, P 22 INT C MACH LEAR, P321, DOI DOI 10.1145/1102351.1102392