Clustering-Based Ensembles as an Alternative to Stacking

被引:22
作者
Jurek, Anna [1 ]
Bi, Yaxin [1 ]
Wu, Shengli [1 ]
Nugent, Chris D. [1 ]
机构
[1] Univ Ulster, Sch Comp & Math, Newtownabbey BT37 0QB, Antrim, North Ireland
关键词
Combining classifiers; stacking; ensembles; clustering; meta-learning; semi-supervised classification; DYNAMIC CLASSIFIER SELECTION; COMBINING CLASSIFIERS; COMBINATION;
D O I
10.1109/TKDE.2013.49
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the most popular techniques of generating classifier ensembles is known as stacking which is based on a meta-learning approach. In this paper, we introduce an alternative method to stacking which is based on cluster analysis. Similar to stacking, instances from a validation set are initially classified by all base classifiers. The output of each classifier is subsequently considered as a new attribute of the instance. Following this, a validation set is divided into clusters according to the new attributes and a small subset of the original attributes of the instances. For each cluster, we find its centroid and calculate its class label. The collection of centroids is considered as a meta-classifier. Experimental results show that the new method outperformed all benchmark methods, namely Majority Voting, Stacking J48, Stacking LR, AdaBoost J48, and Random Forest, in 12 out of 22 data sets. The proposed method has two advantageous properties: it is very robust to relatively small training sets and it can be applied in semi-supervised learning problems. We provide a theoretical investigation regarding the proposed method. This demonstrates that for the method to be successful, the base classifiers applied in the ensemble should have greater than 50% accuracy levels.
引用
收藏
页码:2120 / 2137
页数:18
相关论文
共 50 条
  • [21] Automated clustering of ensembles of alternative models in protein structure databases
    Domingues, FS
    Rahnenführer, J
    Lengauer, T
    PROTEIN ENGINEERING DESIGN & SELECTION, 2004, 17 (06) : 537 - 543
  • [22] Clustering-based Automated Requirement Trace Retrieval
    Al-walidi, Nejood Hashim
    Azab, Shahira Shaaban
    Khamis, Abdelaziz
    Darwish, Nagy Ramadan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (12) : 783 - 792
  • [23] A Clustering-Based Approach to Kinetic Closest Pair
    Zahed Rahmati
    Timothy M. Chan
    Algorithmica, 2018, 80 : 2742 - 2756
  • [24] Survey on Clustering-Based Image Segmentation Techniques
    Zou, Yanni
    Liu, Bo
    2016 IEEE 20th International Conference on Computer Supported Cooperative Work in Design (CSCWD), 2016, : 106 - 110
  • [25] Clustering-Based Interpretation of Deep ReLU Network
    Picchiotti, Nicola
    Gori, Marco
    AIXIA 2021 - ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, 13196 : 403 - 412
  • [26] On hierarchical clustering-based approach for RDDBS design
    Hassan I. Abdalla
    Ali A. Amer
    Sri Devi Ravana
    Journal of Big Data, 10
  • [27] Fuzzy Clustering-Based Approach for Outlier Detection
    Al-Zoubi, Moh'd Belal
    Ali, Al-Dahoud
    Yahya, Abdelfatah A.
    RECENT ADVANCES AND APPLICATIONS OF COMPUTER ENGINEERING: PROCEEDINGS OF THE 9TH WSEAS INTERNATIONAL CONFERENCE (ACE 10), 2010, : 192 - +
  • [28] A Clustering-Based Patient Grouper for Burn Care
    Onah, Chimdimma Noelyn
    Allmendinger, Richard
    Handl, Julia
    Yiapanis, Paraskevas
    Dunn, Ken W.
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING (IDEAL 2019), PT II, 2019, 11872 : 123 - 131
  • [29] A Clustering-Based Approach to Kinetic Closest Pair
    Rahmati, Zahed
    Chan, Timothy M.
    ALGORITHMICA, 2018, 80 (10) : 2742 - 2756
  • [30] Clustering-Based Denoising With Locally Learned Dictionaries
    Chatterjee, Priyam
    Milanfar, Peyman
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2009, 18 (07) : 1438 - 1451