Hyper-Heuristic Framework for Sequential Semi-Supervised Classification Based on Core Clustering

被引:4
作者
Adnan, Ahmed [1 ]
Muhammed, Abdullah [1 ]
Abd Ghani, Abdul Azim [2 ]
Abdullah, Azizol [1 ]
Hakim, Fahrul [1 ]
机构
[1] Univ Putra Malaysia, Fac Comp Sci & Informat Technol, Dept Commun Technol & Networks, Serdang 43300, Malaysia
[2] Univ Putra Malaysia, Fac Comp Sci & Informat Technol, Dept Software Engn & Informat Syst, Serdang 43300, Malaysia
来源
SYMMETRY-BASEL | 2020年 / 12卷 / 08期
关键词
hyper-heuristic; extreme learning machine; genetic algorithm; online clustering; off-line clustering; evolving stream data; semi-supervised classification; EXTREME LEARNING MACHINES; BIG DATA; STREAM; ALGORITHM;
D O I
10.3390/sym12081292
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Existing stream data learning models with limited labeling have many limitations, most importantly, algorithms that suffer from a limited capability to adapt to the evolving nature of data, which is called concept drift. Hence, the algorithm must overcome the problem of dynamic update in the internal parameters or countering the concept drift. However, using neural network-based semi-supervised stream data learning is not adequate due to the need for capturing quickly the changes in the distribution and characteristics of various classes of the data whilst avoiding the effect of the outdated stored knowledge in neural networks (NN). This article presents a prominent framework that integrates each of the NN, a meta-heuristic based on evolutionary genetic algorithm (GA) and a core online-offline clustering (Core). The framework trains the NN on previously labeled data and its knowledge is used to calculate the error of the core online-offline clustering block. The genetic optimization is responsible for selecting the best parameters of the core model to minimize the error. This integration aims to handle the concept drift. We designated this model as hyper-heuristic framework for semi-supervised classification or HH-F. Experimental results of the application of HH-F on real datasets prove the superiority of the proposed framework over the existing state-of-the art approaches used in the literature for sequential classification data with evolving nature.
引用
收藏
页数:20
相关论文
共 37 条
[1]   High-Performance Extreme Learning Machines: A Complete Toolbox for Big Data Applications [J].
Akusok, Anton ;
Bjork, Kaj-Mikael ;
Miche, Yoan ;
Lendasse, Amaury .
IEEE ACCESS, 2015, 3 :1011-1025
[2]  
Albadra M.A.A., 2017, INT J APPL ENG RES, V12, P4610, DOI DOI 10.37622/000000
[3]   MuDi-Stream: A multi density clustering algorithm for evolving data stream [J].
Amini, Amineh ;
Saboohi, Hadi ;
Herawan, Tutut ;
Teh Ying Wah .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2016, 59 :370-385
[4]  
[Anonymous], ADV NEURAL INFORM PR
[5]  
BAI L, 2016, IEEE T KNOWL DATA EN, V0028, P02871, DOI DOI 10.1109/TKDE.2016.2594068
[6]   Social big data: Recent achievements and new challenges [J].
Bello-Orgaz, Gema ;
Jung, Jason J. ;
Camacho, David .
INFORMATION FUSION, 2016, 28 :45-59
[7]   An iterative boosting-based ensemble for streaming data classification [J].
Bertini Junior, Joao Roberto ;
Nicoletti, Maria do Carmo .
INFORMATION FUSION, 2019, 45 :66-78
[8]   Consensus Classification Using Non-Optimized Classifiers [J].
Brownfield, Brett ;
Lemos, Tony ;
Kalivas, John H. .
ANALYTICAL CHEMISTRY, 2018, 90 (07) :4429-4437
[9]   Data Stream Classification by Dynamic Incremental Semi-Supervised Fuzzy Clustering [J].
Casalino, Gabriella ;
Castellano, Giovanna ;
Mencar, Corrado .
INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2019, 28 (08)
[10]   Data Mining for the Internet of Things: Literature Review and Challenges [J].
Chen, Feng ;
Deng, Pan ;
Wan, Jiafu ;
Zhang, Daqiang ;
Vasilakos, Athanasios V. ;
Rong, Xiaohui .
INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2015,