Input decimated ensembles

被引:51
作者
Tumer, K [1 ]
Oza, NC [1 ]
机构
[1] NASA, Ames Res Ctr, Moffett Field, CA 94035 USA
关键词
classification; combining classifier; correlation reduction; dimensionality reduction; ensembles; feature selection;
D O I
10.1007/s10044-002-0181-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Using an ensemble of classifiers instead of a single classifier has been shown to improve generalization performance in many pattern recognition problems. However, the extent of such improvement depends greatly on the amount of correlation among the errors of the base classifiers. Therefore, reducing those correlations while keeping the classifiers' performance levels high is an important area of research. In this article, we explore Input Decimation (ID), a method which selects feature subsets for their ability to discriminate among the classes and uses these subsets to decouple the base classifiers. We provide a summary of the theoretical benefits of correlation reduction, along with results of our method on two underwater sonar data sets, three benchmarks from the Probenl/UCI repositories, and two synthetic data sets. The results indicate that input decimated ensembles outperform ensembles whose base classifiers use all the input features; randomly selected subsets of features; and features created using principal components analysis, on a wide range of domains.
引用
收藏
页码:65 / 77
页数:13
相关论文
共 52 条
[1]  
[Anonymous], COMBINING ARTICIAL N
[2]   USING MUTUAL INFORMATION FOR SELECTING FEATURES IN SUPERVISED NEURAL-NET LEARNING [J].
BATTITI, R .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (04) :537-550
[3]  
BERGER J. O., 2013, Statistical Decision Theory and Bayesian Analysis, DOI [10.1007/978-1-4757-4286-2, DOI 10.1007/978-1-4757-4286-2]
[4]  
Bishop C. M., 1995, NEURAL NETWORKS PATT
[5]  
Blake C.L., 1998, UCI repository of machine learning databases
[6]   Selection of relevant features and examples in machine learning [J].
Blum, AL ;
Langley, P .
ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) :245-271
[7]  
BOLLACKER KD, 1996, P 13 INT C PATT REC, V4, P720
[8]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]  
Cherkauer K. J., 1996, AAAI WORKSH INT MULT, P15
[10]   Sammon's mapping using neural networks: A comparison [J].
de Ridder, D ;
Duin, RPW .
PATTERN RECOGNITION LETTERS, 1997, 18 (11-13) :1307-1316