Clustering-Based Ensembles as an Alternative to Stacking

被引:22
作者
Jurek, Anna [1 ]
Bi, Yaxin [1 ]
Wu, Shengli [1 ]
Nugent, Chris D. [1 ]
机构
[1] Univ Ulster, Sch Comp & Math, Newtownabbey BT37 0QB, Antrim, North Ireland
关键词
Combining classifiers; stacking; ensembles; clustering; meta-learning; semi-supervised classification; DYNAMIC CLASSIFIER SELECTION; COMBINING CLASSIFIERS; COMBINATION;
D O I
10.1109/TKDE.2013.49
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the most popular techniques of generating classifier ensembles is known as stacking which is based on a meta-learning approach. In this paper, we introduce an alternative method to stacking which is based on cluster analysis. Similar to stacking, instances from a validation set are initially classified by all base classifiers. The output of each classifier is subsequently considered as a new attribute of the instance. Following this, a validation set is divided into clusters according to the new attributes and a small subset of the original attributes of the instances. For each cluster, we find its centroid and calculate its class label. The collection of centroids is considered as a meta-classifier. Experimental results show that the new method outperformed all benchmark methods, namely Majority Voting, Stacking J48, Stacking LR, AdaBoost J48, and Random Forest, in 12 out of 22 data sets. The proposed method has two advantageous properties: it is very robust to relatively small training sets and it can be applied in semi-supervised learning problems. We provide a theoretical investigation regarding the proposed method. This demonstrates that for the method to be successful, the base classifiers applied in the ensemble should have greater than 50% accuracy levels.
引用
收藏
页码:2120 / 2137
页数:18
相关论文
共 50 条
[31]   A clustering-based method for unsupervised intrusion detections [J].
Jiang, SY ;
Song, XY ;
Wang, H ;
Han, JJ ;
Li, QH .
PATTERN RECOGNITION LETTERS, 2006, 27 (07) :802-810
[32]   A Clustering-Based Ensemble Technique for Shape Decomposition [J].
Lewin, Sergej ;
Jiang, Xiaoyi ;
Clausing, Achim .
STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2012, 7626 :153-161
[33]   Clustering-based Communication Backbone for UAV Networks [J].
Yu, Hai ;
Huang, Hejiao ;
Jia, Xiaohua .
2018 14TH INTERNATIONAL CONFERENCE ON MOBILE AD-HOC AND SENSOR NETWORKS (MSN 2018), 2018, :1-6
[34]   A Clustering-Based Approach to Reduce Feature Redundancy [J].
de Amorim, Renato Cordeiro ;
Mirkin, Boris .
KNOWLEDGE, INFORMATION AND CREATIVITY SUPPORT SYSTEMS: RECENT TRENDS, ADVANCES AND SOLUTIONS, KICSS 2013, 2016, 364 :465-475
[35]   Clustering-based hierarchical radiosity for dynamic environments [J].
Lee, WY ;
Chuang, JH .
JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 1999, 15 (06) :815-832
[36]   Clustering-Based Compression for Population DNA Sequences [J].
Cheng, Kin-On ;
Law, Ngai-Fong ;
Siu, Wan-Chi .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (01) :208-221
[37]   Clustering-based selective neural network ensemble [J].
Fu Q. ;
Hu S.-X. ;
Zhao S.-Y. .
Journal of Zhejiang University-SCIENCE A, 2005, 6 (5) :387-392
[38]   Clustering-based Assignment within CoMP Systems [J].
Stancanelli, Elvis M. G. ;
Silva, Yuri C. B. ;
Maciel, Tarcisio F. ;
Freitas, Walter C., Jr. ;
Cavalcanti, Francisco R. P. .
2013 IEEE 24TH INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR, AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2013, :2270-2274
[39]   CLUSTERING-BASED METHODS FOR FAST EPITOME GENERATION [J].
Alain, Martin ;
Guillemot, Christine ;
Thoreau, Dominique ;
Guillotel, Philippe .
2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, :211-215
[40]   Spam Detection Using Clustering-Based SVM [J].
Pandya, Darshit .
PROCEEDINGS OF THE 2019 2ND INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND MACHINE INTELLIGENCE (MLMI 2019), 2019, :12-15