EOCD: An ensemble optimization approach for concept drift applications q

被引:11
作者
Feitosa Neto, Antonino [1 ]
Canuto, Anne M. P. [2 ]
机构
[1] Univ Fed Rio Grande do Norte, Postgrad Program Computat & Syst, Natal, RN, Brazil
[2] Univ Fed Rio Grande do Norte, Dept Informat & Appl Math, Natal, RN, Brazil
关键词
Classifier ensembles; Data streams; Concept drifts; Optimization techniques; CLASSIFIERS;
D O I
10.1016/j.ins.2021.01.051
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses pattern classification problems in the data stream context. In this context, the learning concepts may change or evolve and this is called concept drift. These drifts are divided into real or virtual, depending on the variables in which a drift occurs. The real concept drifts consist of a change in the instance labels. In other words, an instance may have its label changed during its processing. Similarly, a virtual concept drift consists of changes in the attributes of instances over time, without changing their labels [10]. In this paper, we will work with real drifts. Independently of being real or virtual, drifts can also be divided into abrupt and gradual, depending on the speed that Data streams applications generate a continuous stream of data in a high rate that it is not possible to store all data in available memory. Hence, it is important to apply techniques that are capable of learning concepts according to data presentation, taking into consider-ation available time, processing and memory resources. This paper presents a new ensemble-based approach to detect concept drift in the data stream context. This approach uses an explicit mechanism to adapt to concept drifts using a genetic algorithm in order to define the best ensemble configuration for the current scenario (detected drifts). The main aim of this approach is to provide an efficient structure to detect and to adapt to concept drifts and, as a consequence, to improve the performance of this system. Our findings show that the proposed method delivers outstanding results, outperforming the results delivered by the most efficient methods presented in the literature. (c) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页码:81 / 100
页数:20
相关论文
共 30 条
[1]  
[Anonymous], 2012, ARXIV12066422
[2]   RDDM: Reactive drift detection method [J].
Barros, Roberto S. M. ;
Cabral, Danilo R. L. ;
Goncalves, Paulo M., Jr. ;
Santos, Silas G. T. C. .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 90 :344-355
[3]  
Bifet A, 2010, LECT NOTES ARTIF INT, V6321, P135, DOI 10.1007/978-3-642-15880-3_15
[4]  
Bifet A, 2010, J MACH LEARN RES, V11, P1601
[5]  
Bifet A, 2007, PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, P443
[6]   Combining block-based and online methods in learning ensembles from concept drifting data streams [J].
Brzezinski, Dariusz ;
Stefanowski, Jerzy .
INFORMATION SCIENCES, 2014, 265 :50-67
[7]   Kappa Updated Ensemble for drifting data stream mining [J].
Cano, Alberto ;
Krawczyk, Bartosz .
MACHINE LEARNING, 2020, 109 (01) :175-218
[8]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[9]  
Domingos P., 2000, Proceedings. KDD-2000. Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P71, DOI 10.1145/347090.347107
[10]   An adaptive distributed ensemble approach to mine concept-drifting data streams [J].
Folino, Gianluigi ;
Pizzuti, Clara ;
Spezzano, Giandomenico .
19TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, VOL II, PROCEEDINGS, 2007, :183-187