A comprehensive ensemble classification techniques detecting and managing concept drift in dynamic imbalanced data streams

被引:0
作者
Junaid, K. A. Mohamed [1 ]
Paulraj, D. [2 ]
Sethukarasi, T. [2 ]
机构
[1] R M K Engn Coll, Dept Elect & Commun Engn, Chennai, India
[2] R M K Engn Coll, Dept Comp Sci & Engn, Chennai, India
关键词
Machine learning; Ensemble classifier; Concept drift; Heterogeneous data stream;
D O I
10.1007/s11276-024-03742-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data stream mining is essential in various fields such as education, the Internet of Things (IoT), social media, entertainment, weather monitoring, and finance. This is due to the continuous and huge amount of data generated by applications in these sectors. Moreover, this data stream is prone to concept drift, in addition to showing characteristics of heterogeneity and imbalance. Contemporary methods for addressing unbalanced learning in data mining often employ classifiers that are tailored to the number of features required for categorization. The control of concept drift is an absolute necessity due to the ever-changing data distributions and the endless and rapid nature of the various data streams. Concept drift is an obstacle in heterogeneous stream data mining, marked by noticeable variations that can range from massive to more complex changes. When addressing drifts, conventional approaches often employ fixed-size blocks or windows, posing challenges in managing events that are in a continuous state of change. This paper introduces a novel approach called "Ensemble Classification Techniques Detecting and Managing Concept Drift in Dynamic and Imbalanced Data Streams" to address these issues. Our method aims to effectively adjust to different types of concept drift by providing a precise and flexible classification of distinct data streams. The suggested ensemble classifier is a valuable contribution to stream data mining, since it effectively addresses the intricate challenges associated with dynamic concept drifts. Experimental results proved that the proposed method has demonstrated superior performance compared to existing methods. According to the findings of the experiment, the proposed method obtains a precision of 69.28% and a recall rate of 69.54%, which gives it an advantage over other methods that produce results that are almost identical.
引用
收藏
页码:19 / 30
页数:12
相关论文
共 31 条
[21]  
Nedham Wisal Bassim, 2022, 2022 INT C NATURAL A, P23, DOI DOI 10.1109/ICNAS55512.2022.9944716
[22]  
Oza NikunjC., 2001, Proceedings of the seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P359, DOI DOI 10.1145/502512.502565
[23]   Deep Ensemble-Based Classifier for Transfer Learning in Rotating Machinery Fault Diagnosis [J].
Pacheco, Fannia ;
Drimus, Alin ;
Duggen, Lars ;
Cerrada, Mariela ;
Cabrera, Diego ;
Sanchez, Rene-Vinicio .
IEEE ACCESS, 2022, 10 :29778-29787
[24]  
Pelossof R., 2008, ONLINE COORDINATE BO, P19
[25]  
Saeedi Iman Dakhil Idan, 2022, AIP Conference Proceedings, V2398, DOI 10.1063/5.0093971
[26]   Perceptually Important Points-Based Data Aggregation Method for Wireless Sensor Networks [J].
Saeedi, Iman Dakhil Idan ;
Al-Qurabat, Ali Kadhum M. .
BAGHDAD SCIENCE JOURNAL, 2022, 19 (04) :875-886
[27]   NMal-Droid: network-based android malware detection system using transfer learning and CNN-BiGRU ensemble [J].
Ullah, Farhan ;
Ullah, Shamsher ;
Srivastava, Gautam ;
Lin, Jerry Chun-Wei ;
Zhao, Yue .
WIRELESS NETWORKS, 2024, 30 (06) :6177-6198
[28]   IoT Assisted Machine Learning Model for Warehouse Management [J].
Wang, Lanjing ;
Hamad, Abdulsattar Abdullah ;
Sakthivel, V .
JOURNAL OF INTERCONNECTION NETWORKS, 2022, 22 (SUPP02)
[29]  
Wankhade K, 2012, 2012 2ND IEEE INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), P657, DOI 10.1109/PDGC.2012.6449898
[30]   A clustering and ensemble based classifier for data stream classification [J].
Wankhade, Kapil K. ;
Jondhale, Kalpana C. ;
Dongre, Snehlata S. .
APPLIED SOFT COMPUTING, 2021, 102