Considering diversity and accuracy simultaneously for ensemble pruning

被引:65
作者
Dai, Qun [1 ]
Ye, Rui [1 ]
Liu, Zhuan [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Ensemble selection; Greedy ensemble pruning (GEP) algorithm; Simultaneous diversity & accuracy (SDAcc); Diversity-focused-two (DFTwo); Accuracy-reinforcement (AccRein); SELECTION; CLASSIFIERS; ALGORITHM; MACHINE; ERROR;
D O I
10.1016/j.asoc.2017.04.058
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Diversity among individual classifiers is widely recognized to be a key factor to successful ensemble selection, while the ultimate goal of ensemble pruning is to improve its predictive accuracy. Diversity and accuracy are two important properties of an ensemble. Existing ensemble pruning methods always consider diversity and accuracy separately. However, in contrast, the two closely interrelate with each other, and should be considered simultaneously. Accordingly, three new measures, i.e., Simultaneous Diversity & Accuracy, Diversity-Focused-Two and Accuracy-Reinforcement, are developed for pruning the ensemble by greedy algorithm. The motivation for Simultaneous Diversity & Accuracy is to consider the difference between the subensemble and the candidate classifier, and simultaneously, to consider the accuracy of both of them. With Simultaneous Diversity & Accuracy, those difficult samples are not given up so as to further improve the generalization performance of the ensemble. The inspiration of devising Diversity-Focused-Two stems from the cognition that ensemble diversity attaches more importance to the difference among the classifiers in an ensemble. Finally, the proposal of Accuracy-Reinforcement reinforces the concern about ensemble accuracy. Extensive experiments verified the effectiveness and efficiency of the proposed three pruning measures. Through the investigation of this work, it is found that by considering diversity and accuracy simultaneously for ensemble pruning, well-performed selective ensemble with superior generalization capability can be acquired, which is the scientific value of this paper. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:75 / 91
页数:17
相关论文
共 51 条
[1]  
[Anonymous], 18 NAT C ART INT
[2]  
[Anonymous], 6 INT C DAT MIN HONG
[3]  
[Anonymous], 11 EUR C MACH LEARN
[4]  
[Anonymous], INT C ART INT APPL
[5]  
[Anonymous], 15 INT C PATT REC
[6]  
Banfield R. E., 2005, Information Fusion, V6, P49, DOI 10.1016/j.inffus.2004.04.005
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]  
Caruana R., 2004, P 21 INT C MACH LEAR, P18, DOI DOI 10.1145/1015330.1015432
[9]  
Chandra A, 2006, STUD COMP INTELL, V16, P429
[10]  
CORMEN TH, 2001, INTRO ALGORITHMS