Hyper-Heuristic Framework for Sequential Semi-Supervised Classification Based on Core Clustering

被引:4
作者
Adnan, Ahmed [1 ]
Muhammed, Abdullah [1 ]
Abd Ghani, Abdul Azim [2 ]
Abdullah, Azizol [1 ]
Hakim, Fahrul [1 ]
机构
[1] Univ Putra Malaysia, Fac Comp Sci & Informat Technol, Dept Commun Technol & Networks, Serdang 43300, Malaysia
[2] Univ Putra Malaysia, Fac Comp Sci & Informat Technol, Dept Software Engn & Informat Syst, Serdang 43300, Malaysia
来源
SYMMETRY-BASEL | 2020年 / 12卷 / 08期
关键词
hyper-heuristic; extreme learning machine; genetic algorithm; online clustering; off-line clustering; evolving stream data; semi-supervised classification; EXTREME LEARNING MACHINES; BIG DATA; STREAM; ALGORITHM;
D O I
10.3390/sym12081292
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Existing stream data learning models with limited labeling have many limitations, most importantly, algorithms that suffer from a limited capability to adapt to the evolving nature of data, which is called concept drift. Hence, the algorithm must overcome the problem of dynamic update in the internal parameters or countering the concept drift. However, using neural network-based semi-supervised stream data learning is not adequate due to the need for capturing quickly the changes in the distribution and characteristics of various classes of the data whilst avoiding the effect of the outdated stored knowledge in neural networks (NN). This article presents a prominent framework that integrates each of the NN, a meta-heuristic based on evolutionary genetic algorithm (GA) and a core online-offline clustering (Core). The framework trains the NN on previously labeled data and its knowledge is used to calculate the error of the core online-offline clustering block. The genetic optimization is responsible for selecting the best parameters of the core model to minimize the error. This integration aims to handle the concept drift. We designated this model as hyper-heuristic framework for semi-supervised classification or HH-F. Experimental results of the application of HH-F on real datasets prove the superiority of the proposed framework over the existing state-of-the art approaches used in the literature for sequential classification data with evolving nature.
引用
收藏
页数:20
相关论文
共 37 条
[31]  
Moustafa N, 2017, DATA ANAL DECISION S
[32]   An incremental intrusion detection system using a new semi-supervised stream classification method [J].
Noorbehbahani, Fakhroddin ;
Fanian, Ali ;
Mousavi, Rasoul ;
Hasannejad, Homa .
INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2017, 30 (04)
[33]   A Study on the Relationships of Classifier Performance Metrics [J].
Seliya, Naeem ;
Khoshgoftaar, Taghi M. ;
Van Hulse, Jason .
ICTAI: 2009 21ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, 2009, :59-+
[34]   A grid density based framework for classifying streaming data in the presence of concept drift [J].
Sethi, Tegjyot Singh ;
Kantardzic, Mehmed ;
Hu, Hanquing .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2016, 46 (01) :179-211
[35]   Large-scale cyber attacks monitoring using Evolving Cauchy Possibilistic Clustering [J].
Skrjanc, Igor ;
Ozawa, Seiichi ;
Ban, Tao ;
Dovzan, Dejan .
APPLIED SOFT COMPUTING, 2018, 62 :592-601
[36]  
Tavallaee M., 2009, P IEEE S COMP INT SE, P1, DOI DOI 10.1109/CISDA.2009.5356528
[37]   Electric Load Forecasting by Hybrid Self-Recurrent Support Vector Regression Model With Variational Mode Decomposition and Improved Cuckoo Search Algorithm [J].
Zhang, Zichen ;
Hong, Wei-Chiang ;
Li, Junchi .
IEEE ACCESS, 2020, 8 :14642-14658