Dynamic ensemble selection classification algorithm based on window over imbalanced drift data stream

被引:8
|
作者
Han, Meng [1 ]
Zhang, Xilong [1 ]
Chen, Zhiqiang [1 ]
Wu, Hongxin [1 ]
Li, Muhang [1 ]
机构
[1] North Minzu Univ, Sch Comp Sci & Engn, Yinchuan, Ningxia, Peoples R China
关键词
Data stream; Imbalance data; Concept drift; Window sampling; Ensemble classification;
D O I
10.1007/s10115-022-01791-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data stream classification is an important research direction in the field of data mining, but in many practical applications, it is impossible to collect the complete training set at one time, and the data may be in an imbalanced state and interspersed with concept drift, which will greatly affect the classification performance. To this end, an online dynamic ensemble selection classification algorithm based on window over imbalanced drift data stream (DESW-ID) is proposed. The algorithm employs various balancing measures, first resampling the data stream using Poisson distribution, and if it is in a highly imbalanced state then secondary sampling is performed using a window storing a minority class instances to achieve the current balanced state of the data. To improve the processing efficiency of the algorithm, a classifier selection ensemble is proposed to dynamically adjust the number of classifiers, and the algorithm runs with an ADWIN detector to detect the presence of concept drift. The experimental results show that the proposed algorithm ranks first on average in all five classification performance metrics compared to the state-of-the-art methods. Therefore, the proposed algorithm has better classification performance for imbalanced data streams with concept drift and also improves the operation efficiency of the algorithm.
引用
收藏
页码:1105 / 1128
页数:24
相关论文
共 50 条
  • [1] Dynamic ensemble selection classification algorithm based on window over imbalanced drift data stream
    Meng Han
    Xilong Zhang
    Zhiqiang Chen
    Hongxin Wu
    Muhang Li
    Knowledge and Information Systems, 2023, 65 : 1105 - 1128
  • [2] Data Preprocessing and Dynamic Ensemble Selection for Imbalanced Data Stream Classification
    Zyblewski, Pawel
    Sabourin, Robert
    Wozniak, Michal
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II, 2020, 1168 : 367 - 379
  • [3] Dynamic Ensemble Selection for Imbalanced Data Stream Classification with Limited Label Access
    Zyblewski, Pawel
    Wozniak, Michal
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING (ICAISC 2021), PT II, 2021, 12855 : 217 - 226
  • [4] Dynamic Ensemble Selection for Imbalanced Data Streams With Concept Drift
    Jiao, Botao
    Guo, Yinan
    Gong, Dunwei
    Chen, Qiuju
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 1278 - 1291
  • [5] Selection-based resampling ensemble algorithm for nonstationary imbalanced stream data learning
    Ren, Siqi
    Zhu, Wen
    Liao, Bo
    Li, Zeng
    Wang, Peng
    Li, Keqin
    Chen, Min
    Li, Zejun
    KNOWLEDGE-BASED SYSTEMS, 2019, 163 : 705 - 722
  • [6] BASWE: Balanced Accuracy-Based Sliding Window Ensemble for Classification in Imbalanced Data Streams with Concept Drift
    de Oliveira, Douglas Amorim
    Delgado, Karina Valdivia
    Lauretto, Marcelo de Souza
    INTELLIGENT SYSTEMS, BRACIS 2024, PT I, 2025, 15412 : 231 - 246
  • [7] An online ensemble classification algorithm for multi-class imbalanced data stream
    Han, Meng
    Li, Chunpeng
    Meng, Fanxing
    He, Feifei
    Zhang, Ruihua
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (11) : 6845 - 6880
  • [8] Ensemble classification algorithm based improved SMOTE for imbalanced data
    Ning, Liu, 1600, Natsional'nyi Hirnychyi Universytet
  • [9] Handling imbalanced data with concept drift by applying dynamic sampling and ensemble classification model
    Ancy, S.
    Paulraj, D.
    COMPUTER COMMUNICATIONS, 2020, 153 : 553 - 560
  • [10] Online ensemble learning algorithm for imbalanced data stream
    Hongle, Du
    Yan, Zhang
    Gang, Ke
    Lin, Zhang
    Chen, Yeh-Cheng
    APPLIED SOFT COMPUTING, 2021, 107