Dynamic ensemble selection classification algorithm based on window over imbalanced drift data stream

被引:8
|
作者
Han, Meng [1 ]
Zhang, Xilong [1 ]
Chen, Zhiqiang [1 ]
Wu, Hongxin [1 ]
Li, Muhang [1 ]
机构
[1] North Minzu Univ, Sch Comp Sci & Engn, Yinchuan, Ningxia, Peoples R China
关键词
Data stream; Imbalance data; Concept drift; Window sampling; Ensemble classification;
D O I
10.1007/s10115-022-01791-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data stream classification is an important research direction in the field of data mining, but in many practical applications, it is impossible to collect the complete training set at one time, and the data may be in an imbalanced state and interspersed with concept drift, which will greatly affect the classification performance. To this end, an online dynamic ensemble selection classification algorithm based on window over imbalanced drift data stream (DESW-ID) is proposed. The algorithm employs various balancing measures, first resampling the data stream using Poisson distribution, and if it is in a highly imbalanced state then secondary sampling is performed using a window storing a minority class instances to achieve the current balanced state of the data. To improve the processing efficiency of the algorithm, a classifier selection ensemble is proposed to dynamically adjust the number of classifiers, and the algorithm runs with an ADWIN detector to detect the presence of concept drift. The experimental results show that the proposed algorithm ranks first on average in all five classification performance metrics compared to the state-of-the-art methods. Therefore, the proposed algorithm has better classification performance for imbalanced data streams with concept drift and also improves the operation efficiency of the algorithm.
引用
收藏
页码:1105 / 1128
页数:24
相关论文
共 50 条
  • [41] G-mean Weighted Classification Method for Imbalanced Data Stream with Concept Drift
    Liang B.
    Li G.
    Dai C.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (12): : 2844 - 2857
  • [42] Ensemble Approach for the Classification of Imbalanced Data
    Nikulin, Vladimir
    McLachlan, Geoffrey J.
    Ng, Shu Kay
    AI 2009: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, 5866 : 291 - +
  • [43] Employing One-Class SVM Classifier Ensemble for Imbalanced Data Stream Classification
    Klikowski, Jakub
    Wozniak, Michal
    COMPUTATIONAL SCIENCE - ICCS 2020, PT IV, 2020, 12140 : 117 - 127
  • [44] Dynamic weighted selective ensemble learning algorithm for imbalanced data streams
    Zhang Yan
    Du Hongle
    Ke Gang
    Zhang Lin
    Yeh-Cheng Chen
    The Journal of Supercomputing, 2022, 78 : 5394 - 5419
  • [45] Dynamic weighted selective ensemble learning algorithm for imbalanced data streams
    Yan, Zhang
    Du Hongle
    Gang, Ke
    Lin, Zhang
    Chen, Yeh-Cheng
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (04): : 5394 - 5419
  • [46] A dynamic ensemble algorithm for anomaly detection in IoT imbalanced data streams
    Jiang, Jun
    Liu, Fagui
    Liu, Yongheng
    Tang, Quan
    Wang, Bin
    Zhong, Guoxiang
    Wang, Weizheng
    COMPUTER COMMUNICATIONS, 2022, 194 : 250 - 257
  • [47] Recurring Drift Detection and Model Selection-Based Ensemble Classification for Data Streams with Unlabeled Data
    Peipei Li
    Man Wu
    Junhong He
    Xuegang Hu
    New Generation Computing, 2021, 39 : 341 - 376
  • [48] Recurring Drift Detection and Model Selection-Based Ensemble Classification for Data Streams with Unlabeled Data
    Li, Peipei
    Wu, Man
    He, Junhong
    Hu, Xuegang
    NEW GENERATION COMPUTING, 2021, 39 (02) : 341 - 376
  • [49] An Ensemble Classification Model Based on Imbalanced Data for Aviation Safety
    NI Xiaomei
    WANG Huawei
    LV Shaolan
    XIONG Minglan
    WuhanUniversityJournalofNaturalSciences, 2021, 26 (05) : 437 - 443
  • [50] Spark-based ensemble learning for imbalanced data classification
    Ding J.
    Wang S.
    Jia L.
    You J.
    Jiang Y.
    International Journal of Performability Engineering, 2018, 14 (05) : 945 - 964