Novel clustering-based pruning algorithms

被引:10
作者
Zyblewski, Pawel [1 ]
Wozniak, Michal [1 ]
机构
[1] Wroclaw Univ Sci & Technol, Fac Elect, Dept Syst & Comp Networks, Wybrzeze Wyspianskiego 27, PL-50370 Wroclaw, Poland
关键词
Ensemble pruning; Classifier ensemble; Clustering; Multistage organization; ENSEMBLES; DIVERSITY;
D O I
10.1007/s10044-020-00867-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the crucial problems of designing a classifier ensemble is the proper choice of the base classifier line-up. Basically, such an ensemble is formed on the basis of individual classifiers, which are trained in such a way to ensure their high diversity or they are chosen on the basis of pruning which reduces the number of predictive models in order to improve efficiency and predictive performance of the ensemble. This work is focusing on clustering-based ensemble pruning, which looks for the group of similar classifiers which are replaced by their representatives. We propose a novel pruning criterion based on well-known diversity measures and describe three algorithms using classifier clustering. The first method selects the model with the best predictive performance from each cluster to form the final ensemble, the second one employs the multistage organization, where instead of removing the classifiers from the ensemble each classifier cluster makes the decision independently, while the third proposition combines multistage organization and sampling with replacement. The proposed approaches were evaluated using 30 datasets with different characteristics. Experimentation results validated through statistical tests confirmed the usefulness of the proposed approaches.
引用
收藏
页码:1049 / 1058
页数:10
相关论文
共 32 条
[1]  
Alcalá-Fdez J, 2011, J MULT-VALUED LOG S, V17, P255
[2]  
[Anonymous], 2005, J ZHEJIANG U SCI
[3]  
[Anonymous], 2001 IEEE INNS INT C
[4]  
[Anonymous], 15 INT C PATT REC IC
[5]   Clustering ensembles of neural network models [J].
Bakker, B ;
Heskes, T .
NEURAL NETWORKS, 2003, 16 (02) :261-269
[6]  
Bian S., 2007, INT J HYBRID INTELLI, V4, P103, DOI DOI 10.3233/HIS-2007-4204
[7]  
Cunningham P, 2000, LECT NOTES ARTIF INT, V1810, P109
[8]   A competitive ensemble pruning approach based on cross-validation technique [J].
Dai, Qun .
KNOWLEDGE-BASED SYSTEMS, 2013, 37 :394-414
[9]   An experimental comparison of three methods for constructing ensembles of decision trees: Bagging, boosting, and randomization [J].
Dietterich, TG .
MACHINE LEARNING, 2000, 40 (02) :139-157
[10]  
Dua D., 2017, UCI machine learning repository