Accuracy/diversity and ensemble MLP classifier design

被引:116
作者
Windeatt, Terry [1 ]
机构
[1] Univ Surrey, Sch Elect & Phys Sci, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, Surrey, England
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 2006年 / 17卷 / 05期
关键词
Boolean; diversity; error-correcting output coding (ECOC); face identification; multiple classifiers;
D O I
10.1109/TNN.2006.875979
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The difficulties of tuning parameters of multilayer perceptrons (MLP) classifiers are well known. In this paper, a measure is described that is capable of predicting the number of classifier training epochs for achieving optimal performance in an ensemble of MLP classifiers. The measure is computed between pairs of patterns on the training data and is based on a spectral representation of a Boolean function. This representation characterizes; the mapping from classifier decisions to target label and allows accuracy and diversity to be incorporated within a single measure. Results on many benchmark problems, including the Olivetti Research Laboratory (ORL) face database demonstrate that the measure is well correlated with base-classifier test error, and may be used to predict the optimal number of training epochs. While correlation with ensemble test error is not quite as strong, it is shown in this paper that the measure may be used to predict number of epochs for optimal ensemble performance. Although the technique is only applicable to two-class problems, it is extended here to multiclass through output coding. For the output-coding technique, a random code matrix is shown to give better performance than one-per-class code, even when the base classifier is well-tuned.
引用
收藏
页码:1194 / 1211
页数:18
相关论文
共 37 条
[1]   Reducing multiclass to binary: A unifying approach for margin classifiers [J].
Allwein, EL ;
Schapire, RE ;
Singer, Y .
JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (02) :113-141
[2]  
[Anonymous], 1998, UCI REPOSITORY MACHI
[3]  
Breiman L, 1998, ANN STAT, V26, P801
[4]   Bagging predictors [J].
Breiman, L .
MACHINE LEARNING, 1996, 24 (02) :123-140
[5]   Randomizing outputs to increase prediction accuracy [J].
Breiman, L .
MACHINE LEARNING, 2000, 40 (03) :229-242
[6]   Estimating generalization error on two-class datasets using out-of-bag estimates [J].
Bylander, T .
MACHINE LEARNING, 2002, 48 (1-3) :287-297
[7]  
CARUNA R, 2000, P NEUR INF PROC SYST
[8]   On the learnability and design of output codes for multiclass problems [J].
Crammer, K ;
Singer, Y .
MACHINE LEARNING, 2002, 47 (2-3) :201-233
[9]  
CRAMMER K, 2002, ADV NEURAL INFORMATI, V14
[10]  
Dietterich TG, 1994, J ARTIF INTELL RES, V2, P263