A comparative study on concept drift detectors

被引:131
作者
Goncalves, Paulo M., Jr. [1 ]
de Carvalho Santos, Silas G. T. [2 ]
Barros, Roberto S. M. [2 ]
Vieira, Davi C. L. [2 ]
机构
[1] Inst Fed Educ Ciencia & Tecnol Pernambuco, Recife, PE, Brazil
[2] Univ Fed Pernambuco, Ctr Informat, Recife, PE, Brazil
关键词
Data streams; Time-changing data; Concept drift detectors; Comparison; CLASSIFIERS; CHARTS;
D O I
10.1016/j.eswa.2014.07.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In data stream environments, drift detection methods are used to identify when the context has changed. This paper evaluates eight different concept drift detectors (Dom, EDDM, PHT, STEPD, DOF, ADWIN, Paired Learners, and ECDD) and performs tests using artificial datasets affected by abrupt and gradual concept drifts, with several rates of drift, with and without noise and irrelevant attributes, and also using real-world datasets. In addition, a 2(k) factorial design was used to indicate the parameters that most influence performance which is a novelty in the area. Also, a variation of the Friedman non-parametric statistical test was used to identify the best methods. Experiments compared accuracy, evaluation time, as well as false alarm and miss detection rates. Additionally, we used the Mahalanobis distance to measure how similar the methods are when compared to the best possible detection output. This work can, to some extent, also be seen as a research survey of existing drift detection methods. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:8144 / 8156
页数:13
相关论文
共 33 条
[1]  
[Anonymous], 2004, LECT NOTES COMPUTER
[2]  
[Anonymous], 1963, DISTRIBUTION FREE MU
[3]  
[Anonymous], 1936, P NATL I SCI INDIA, DOI DOI 10.1007/S13171-019-00164-5
[4]   Paired Learners for Concept Drift [J].
Bach, Stephen H. ;
Maloof, Marcus A. .
ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, :23-32
[5]  
Baena-Garcia M, 2006, 4 INT WORKSH KNOWL D, V6, P77
[6]  
Bifet A, 2010, J MACH LEARN RES, V11, P1601
[7]   Fast Perceptron Decision Tree Learning from Evolving Data Streams [J].
Bifet, Albert ;
Holmes, Geoff ;
Pfahringer, Bernhard ;
Frank, Eibe .
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT II, PROCEEDINGS, 2010, 6119 :299-310
[8]  
Bifet A, 2009, KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, P139
[9]  
Bifet A, 2007, PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, P443
[10]   A case-based technique for tracking concept drift in spam filtering [J].
Delany, SJ ;
Cunningham, P ;
Tsymbal, A ;
Coyle, L .
KNOWLEDGE-BASED SYSTEMS, 2005, 18 (4-5) :187-195