Concept drift type identification based on multi-sliding windows

被引:33
作者
Guo, Husheng [1 ,2 ]
Li, Hai [1 ]
Ren, Qiaoyan [1 ]
Wang, Wenjian [1 ,2 ]
机构
[1] Shanxi Univ, Sch Comp & Informat Technol, Taiyuan 030006, Shanxi, Peoples R China
[2] Shanxi Univ, Key Lab Computat Intelligence & Chinese Informat, Minist Educ, Taiyuan 030006, Shanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Streaming data; Concept drift; Sliding window; Drift category; Drift subcategory; EVOLVING FUZZY; DATA STREAMS; CLASSIFICATION; ONLINE; REGRESSION; KNOWLEDGE;
D O I
10.1016/j.ins.2021.11.023
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Concept drift is a common and important issue in streaming data analysis and mining. Thus far, many concept drift detection methods have been proposed but may not be able to identify the type of concept drift, which will result in some difficulties, such as extracting the wrong key information, inadequate model learning and poor detection efficiency. To solve these problems, a concept drift type identification method is proposed based on multi sliding windows (CDT_MSW). This method consists of three processes. During the first detection process, the drift position is detected by sliding the basic window forward. Then, in the growth process, the drift length is detected using the growth of the adjoint window, and the drift category is identified according to the drift length. Finally, during tracking process, the drift subcategory can be accurately identified according to the different tracking flow ratio curves generated during window tracking. Experimental results show that the proposed method can effectively identify the type of concept drift, accurately analyze the key information during online learning and improve the efficiency and generalization performance of streaming data analysis and mining. (c) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页码:1 / 23
页数:23
相关论文
共 47 条
[1]  
Ali P, 2017, P 2018 INT JOINT C N
[2]  
[Anonymous], 2018, P AUSTR COMP SCI WEE
[3]  
[Anonymous], 2008, P JOINT CONV HOKK CH
[4]  
Bifet A, 2010, J MACH LEARN RES, V11, P1601
[5]   A cluster-based intelligence ensemble learning method for classification problems [J].
Cui, Shaoze ;
Wang, Yanzhang ;
Yin, Yunqiang ;
Cheng, T. C. E. ;
Wang, Dujuan ;
Zhai, Mingyu .
INFORMATION SCIENCES, 2021, 560 :386-409
[6]   Micro-MetaStream: Algorithm selection for time-changing data [J].
Debiaso Rossi, Andre Luis ;
Soares, Carlos ;
de Souza, Bruno Feres ;
Ponce de Leon Ferreira de Carvalho, Andre Carlos .
INFORMATION SCIENCES, 2021, 565 :262-277
[7]   Learning in Nonstationary Environments: A Survey [J].
Ditzler, Gregory ;
Roveri, Manuel ;
Alippi, Cesare ;
Polikar, Robi .
IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2015, 10 (04) :12-25
[8]  
Dongre PB, 2014, IEEE INT ADV COMPUT, P533
[9]   Implementation of an Evolving Fuzzy Model (eFuMo) in a Monitoring System for a Waste-Water Treatment Process [J].
Dovzan, Dejan ;
Logar, Vito ;
Skrjanc, Igor .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2015, 23 (05) :1761-1776
[10]   Detecting concept drift: An information entropy based method using an adaptive sliding window [J].
Du, Lei ;
Song, Qinbao ;
Jia, Xiaolin .
INTELLIGENT DATA ANALYSIS, 2014, 18 (03) :337-364