A comprehensive ensemble classification techniques detecting and managing concept drift in dynamic imbalanced data streams

被引:0
作者
Junaid, K. A. Mohamed [1 ]
Paulraj, D. [2 ]
Sethukarasi, T. [2 ]
机构
[1] R M K Engn Coll, Dept Elect & Commun Engn, Chennai, India
[2] R M K Engn Coll, Dept Comp Sci & Engn, Chennai, India
关键词
Machine learning; Ensemble classifier; Concept drift; Heterogeneous data stream;
D O I
10.1007/s11276-024-03742-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data stream mining is essential in various fields such as education, the Internet of Things (IoT), social media, entertainment, weather monitoring, and finance. This is due to the continuous and huge amount of data generated by applications in these sectors. Moreover, this data stream is prone to concept drift, in addition to showing characteristics of heterogeneity and imbalance. Contemporary methods for addressing unbalanced learning in data mining often employ classifiers that are tailored to the number of features required for categorization. The control of concept drift is an absolute necessity due to the ever-changing data distributions and the endless and rapid nature of the various data streams. Concept drift is an obstacle in heterogeneous stream data mining, marked by noticeable variations that can range from massive to more complex changes. When addressing drifts, conventional approaches often employ fixed-size blocks or windows, posing challenges in managing events that are in a continuous state of change. This paper introduces a novel approach called "Ensemble Classification Techniques Detecting and Managing Concept Drift in Dynamic and Imbalanced Data Streams" to address these issues. Our method aims to effectively adjust to different types of concept drift by providing a precise and flexible classification of distinct data streams. The suggested ensemble classifier is a valuable contribution to stream data mining, since it effectively addresses the intricate challenges associated with dynamic concept drifts. Experimental results proved that the proposed method has demonstrated superior performance compared to existing methods. According to the findings of the experiment, the proposed method obtains a precision of 69.28% and a recall rate of 69.54%, which gives it an advantage over other methods that produce results that are almost identical.
引用
收藏
页码:19 / 30
页数:12
相关论文
共 31 条
[11]  
Apache Kafka, 2018, "Apache Kafka
[12]  
archive.ics.uci, PIMA INDIANS DIABETE
[13]  
Bifet A, 2010, J MACH LEARN RES, V11, P1601
[14]   A stream processing architecture for heterogeneous data sources in the Internet of Things [J].
Corral-Plaza, David ;
Medina-Bulo, Inmaculada ;
Ortiz, Guadalupe ;
Boubeta-Puig, Juan .
COMPUTER STANDARDS & INTERFACES, 2020, 70
[15]   Trend Analysis and Prediction on Water Consumption in Southwestern Ethiopia [J].
Enbeyle, Wegayehu ;
Hamad, Abdulsttar Abdullah ;
Al-Obeidi, Ahmed S. ;
Abebaw, Solomon ;
Belay, Assaye ;
Markos, Admasu ;
Abate, Lema ;
Derebew, Bizuwork .
JOURNAL OF NANOMATERIALS, 2022, 2022
[16]   Transfer and online learning for IP maliciousness prediction in a concept drift scenario [J].
Garcia, David Escudero ;
DeCastro-Garcia, Noemi .
WIRELESS NETWORKS, 2024, 30 (09) :7423-7444
[17]   A Machine Learning Approach for Rainfall Estimation Integrating Heterogeneous Data Sources [J].
Guarascio, Massimo ;
Folino, Gianluigi ;
Chiaravalloti, Francesco ;
Gabriele, Salvatore ;
Procopio, Antonio ;
Sabatino, Pietro .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[18]   Learning Methods of Business Intelligence and Group Related Diagnostics on Patient Management by Using Artificial Dynamic System [J].
Hamad, Abdulsattar Abdullah ;
Abdulridha, Mustafa Mahdi ;
Kadhim, Noor Mohammed ;
Pushparaj, S. ;
Meenakshi, R. ;
Ibrahim, Abdelrahman Mohamed .
JOURNAL OF NANOMATERIALS, 2022, 2022
[19]   Adversarial defense method based on ensemble learning for modulation signal intelligent recognition [J].
Han, Chao ;
Qin, Ruoxi ;
Wang, Linyuan ;
Cui, Weijia ;
Chen, Jian ;
Yan, Bin .
WIRELESS NETWORKS, 2023, 29 (07) :2967-2980
[20]   A Metadata-Assisted Cascading Ensemble Classification Framework for Automatic Annotation of Open IoT Data [J].
Montori, Federico ;
Liao, Kewen ;
De Giosa, Matteo ;
Jayaraman, Prem Prakash ;
Bononi, Luciano ;
Sellis, Timos ;
Georgakopoulos, Dimitrios .
IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (15) :13401-13413