A comparative study on concept drift detectors

被引:131
作者
Goncalves, Paulo M., Jr. [1 ]
de Carvalho Santos, Silas G. T. [2 ]
Barros, Roberto S. M. [2 ]
Vieira, Davi C. L. [2 ]
机构
[1] Inst Fed Educ Ciencia & Tecnol Pernambuco, Recife, PE, Brazil
[2] Univ Fed Pernambuco, Ctr Informat, Recife, PE, Brazil
关键词
Data streams; Time-changing data; Concept drift detectors; Comparison; CLASSIFIERS; CHARTS;
D O I
10.1016/j.eswa.2014.07.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In data stream environments, drift detection methods are used to identify when the context has changed. This paper evaluates eight different concept drift detectors (Dom, EDDM, PHT, STEPD, DOF, ADWIN, Paired Learners, and ECDD) and performs tests using artificial datasets affected by abrupt and gradual concept drifts, with several rates of drift, with and without noise and irrelevant attributes, and also using real-world datasets. In addition, a 2(k) factorial design was used to indicate the parameters that most influence performance which is a novelty in the area. Also, a variation of the Friedman non-parametric statistical test was used to identify the best methods. Experiments compared accuracy, evaluation time, as well as false alarm and miss detection rates. Additionally, we used the Mahalanobis distance to measure how similar the methods are when compared to the best possible detection output. This work can, to some extent, also be seen as a research survey of existing drift detection methods. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:8144 / 8156
页数:13
相关论文
共 33 条
[21]  
Kolter JZ, 2007, J MACH LEARN RES, V8, P2755
[22]   DDD: A New Ensemble Approach for Dealing with Concept Drift [J].
Minku, Leandro L. ;
Yao, Xin .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (04) :619-633
[23]   The Impact of Diversity on Online Ensemble Learning in the Presence of Concept Drift [J].
Minku, Leandro L. ;
White, Allan P. ;
Yao, Xin .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (05) :730-742
[24]  
Nishida K, 2007, LECT NOTES ARTIF INT, V4755, P264
[25]  
PAGE ES, 1954, BIOMETRIKA, V41, P100, DOI 10.1093/biomet/41.1-2.100
[26]   Control chart tests based on geometric moving averages [J].
Roberts, SW .
TECHNOMETRICS, 2000, 42 (01) :97-101
[27]   Exponentially weighted moving average charts for detecting concept drift [J].
Ross, Gordon J. ;
Adams, Niall M. ;
Tasoulis, Dimitris K. ;
Hand, David J. .
PATTERN RECOGNITION LETTERS, 2012, 33 (02) :191-198
[28]  
Schlimmer J. C., 1986, Machine Learning, V1, P317, DOI 10.1023/A:1022810614389
[29]  
Sebastiao R, 2010, LECT NOTES COMPUT SC, V5840, P25, DOI 10.1007/978-3-642-12519-5_2
[30]  
Sobhani P, 2011, LECT NOTES ARTIF INT, V6943, P88, DOI 10.1007/978-3-642-23857-4_12