Dynamic ensemble selection classification algorithm based on window over imbalanced drift data stream

被引:8
|
作者
Han, Meng [1 ]
Zhang, Xilong [1 ]
Chen, Zhiqiang [1 ]
Wu, Hongxin [1 ]
Li, Muhang [1 ]
机构
[1] North Minzu Univ, Sch Comp Sci & Engn, Yinchuan, Ningxia, Peoples R China
关键词
Data stream; Imbalance data; Concept drift; Window sampling; Ensemble classification;
D O I
10.1007/s10115-022-01791-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data stream classification is an important research direction in the field of data mining, but in many practical applications, it is impossible to collect the complete training set at one time, and the data may be in an imbalanced state and interspersed with concept drift, which will greatly affect the classification performance. To this end, an online dynamic ensemble selection classification algorithm based on window over imbalanced drift data stream (DESW-ID) is proposed. The algorithm employs various balancing measures, first resampling the data stream using Poisson distribution, and if it is in a highly imbalanced state then secondary sampling is performed using a window storing a minority class instances to achieve the current balanced state of the data. To improve the processing efficiency of the algorithm, a classifier selection ensemble is proposed to dynamically adjust the number of classifiers, and the algorithm runs with an ADWIN detector to detect the presence of concept drift. The experimental results show that the proposed algorithm ranks first on average in all five classification performance metrics compared to the state-of-the-art methods. Therefore, the proposed algorithm has better classification performance for imbalanced data streams with concept drift and also improves the operation efficiency of the algorithm.
引用
收藏
页码:1105 / 1128
页数:24
相关论文
共 50 条
  • [21] An ensemble method for data stream classification in the presence of concept drift
    Omid Abbaszadeh
    Ali Amiri
    Ali Reza Khanteymoori
    Frontiers of Information Technology & Electronic Engineering, 2015, 16 : 1059 - 1068
  • [22] An ensemble method for data stream classification in the presence of concept drift
    Omid ABBASZADEH
    Ali AMIRI
    Ali Reza KHANTEYMOORI
    FrontiersofInformationTechnology&ElectronicEngineering, 2015, 16 (12) : 1059 - 1068
  • [23] Deterministic Concept Drift Detection in Ensemble Classifier Based Data Stream Classification Process
    Abdualrhman, Mohammed Ahmed Ali
    Padma, M. C.
    INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2019, 11 (01) : 29 - 48
  • [24] Adaptive Classification Algorithm for Concept Drift Data Stream
    Cai H.
    Lu K.
    Wu Q.
    Wu D.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (03): : 633 - 646
  • [25] A Dynamic Weighted Random Sampling Algorithm on Time-based Sliding Window over Data Stream
    Tang, Da
    Liu, Xiang
    Yue, Qianjin
    2011 INTERNATIONAL CONFERENCE ON COMPUTER, ELECTRICAL, AND SYSTEMS SCIENCES, AND ENGINEERING (CESSE 2011), 2011, : 23 - +
  • [26] Application of Imbalanced Data Classification Quality Metrics as Weighting Methods of the Ensemble Data Stream Classification Algorithms
    Wegier, Weronika
    Ksieniewicz, Pawel
    ENTROPY, 2020, 22 (08)
  • [27] Imbalanced data classification based on improved EIWAPSO-AdaBoost-C ensemble algorithm
    Li, Xiao
    Li, Kewen
    APPLIED INTELLIGENCE, 2022, 52 (06) : 6477 - 6502
  • [28] Imbalanced Data Classification Method Based on Ensemble Learning
    Xiang, Yu
    Xie, Yongping
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL III: SYSTEMS, 2020, 517 : 18 - 24
  • [29] Imbalanced data classification based on improved EIWAPSO-AdaBoost-C ensemble algorithm
    Xiao Li
    Kewen Li
    Applied Intelligence, 2022, 52 : 6477 - 6502
  • [30] DynED: Dynamic Ensemble Diversification in Data Stream Classification
    Abadifard, Soheil
    Bakhshi, Sepehr
    Gheibuni, Sanaz
    Can, Fazli
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 3707 - 3711