Changing Lineup Classifier Ensemble for Drifting Imbalanced Data Streams

被引:1
作者
Wegier, Weronika [1 ]
Maczynski, Maciej [1 ]
Wozniak, Michal [1 ]
机构
[1] Wroclaw Univ Sci & Technol, Dept Syst & Comp Networks, Wroclaw, Poland
来源
2024 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW | 2024年
关键词
imbalanced data; nonstationary data streams; concept drift; classifier ensemble learning; SMOTE;
D O I
10.1109/ICDMW65004.2024.00037
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Currently, most data requires stream processing. This kind of processing causes new challenges in classifier learning method, such as data difficulties for example data imbalance since we cannot observe the entire set of data simultaneously or concept drift, which is the possibility of changes in the probabilistic characteristics of the processed data. These challenges were addressed by proposing a new learning algorithm that maintains consistency with the current data distribution by training a classifier ensemble with the changing lineup and applying dedicated mechanisms to account for the imbalance of the incoming data stream. The proposed IMB-WAE (Imbalance Weighted Aging Ensemble) method has been evaluated on the basis of exhaustive computer experiments conducted on a large number of real and synthetic data streams assessing the impact of its parameters on classification quality and confirming its quality compared to the state-of-the-art algorithm.
引用
收藏
页码:238 / 245
页数:8
相关论文
共 28 条
[1]  
Bifet A., 2018, Machine learning for data streams: with practical examples in MOA
[2]  
Bifet A, 2010, LECT NOTES ARTIF INT, V6321, P135, DOI 10.1007/978-3-642-15880-3_15
[3]  
Bifet A, 2007, PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, P443
[4]  
Brzezinski D, 2011, LECT NOTES ARTIF INT, V6679, P155, DOI 10.1007/978-3-642-21222-2_19
[5]  
Bunkhumpornpat C, 2009, LECT NOTES ARTIF INT, V5476, P475, DOI 10.1007/978-3-642-01307-2_43
[6]   ROSE: robust online self-adjusting ensemble for continual learning on imbalanced drifting data streams [J].
Cano, Alberto ;
Krawczyk, Bartosz .
MACHINE LEARNING, 2022, 111 (07) :2561-2599
[7]   Kappa Updated Ensemble for drifting data stream mining [J].
Cano, Alberto ;
Krawczyk, Bartosz .
MACHINE LEARNING, 2020, 109 (01) :175-218
[8]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[9]   Towards incremental learning of nonstationary imbalanced data stream: a multiple selectively recursive approach [J].
Chen, Sheng ;
He, Haibo .
EVOLVING SYSTEMS, 2011, 2 (01) :35-50
[10]  
Cohen E., 2003, P 22 ACM SIGMOD SIGA, P223, DOI /10.1145/773153.773175