Ensembling neural networks: Many could be better than all

被引：1434

作者：

Zhou, ZH ^{[1
]}

Wu, JX ^{[1
]}

Tang, W ^{[1
]}

机构：

[1] Nanjing Univ, Natl Lab Novel Software Technol, Nanjing 210093, Peoples R China

来源：

ARTIFICIAL INTELLIGENCE | 2002年 / 137卷 / 1-2期

关键词：

neural networks; neural network ensemble; machine learning; selective ensemble; boosting; bagging; genetic algorithm; bias-variance decomposition;

D O I：

10.1016/S0004-3702(02)00190-X

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Neural network ensemble is a learning paradigm where many neural networks are jointly used to solve a problem. In this paper, the relationship between the ensemble and its component neural networks is analyzed from the context of both regression and classification, which reveals that it may be better to ensemble many instead of all of the neural networks at hand. This result is interesting because at present, most approaches ensemble all the available neural networks for prediction. Then, in order to show that the appropriate neural networks for composing an ensemble can be effectively selected from a set of available neural networks, an approach named GASEN is presented. GASEN trains a number of neural networks at first. Then it assigns random weights to those networks and employs genetic algorithm to evolve the weights so that they can characterize to some extent the fitness of the neural networks in constituting an ensemble. Finally it selects some neural networks based on the evolved weights to make up the ensemble. A large empirical study shows that, compared with some popular ensemble approaches such as Bagging and Boosting, GASEN can generate neural network ensembles with far smaller sizes but stronger generalization ability. Furthermore, in order to understand the working mechanism of GASEN, the bias-variance decomposition of the error is provided in this paper, which shows that the success of GASEN may lie in that it can significantly reduce the bias as well as the variance. (C) 2002 Elsevier Science B.V. All rights reserved.

引用

页码：239 / 263

页数：25

共 47 条

[1]

[Anonymous], 1989, GENETIC ALGORITHM SE

[2]

[Anonymous], 1999, Combining Artificial Neural Nets: Ensemble and Modular Multi-Net Systems

[3] An empirical comparison of voting classification algorithms: Bagging, boosting, and variants [J].

Bauer, E ;

Kohavi, R .

MACHINE LEARNING, 1999, 36 (1-2) :105-139

[4]

Blake C.L., 1998, UCI repository of machine learning databases

[5] Random forests [J].

Breiman, L .

MACHINE LEARNING, 2001, 45 (01) :5-32

[6]

Breiman L, 1996, rapport technique n 460

[7]

Cherkauer Kevin J., 1996, Working Notes of the AAAI Workshop on Integrating Multiple Learned Models, P15

[8] Stability problems with artificial neural networks and the ensemble solution [J].

Cunningham, P ;

Carney, J ;

Jacob, S .

ARTIFICIAL INTELLIGENCE IN MEDICINE, 2000, 20 (03) :217-225

[9]

Demuth H., 1998, NEURAL NETWORK TOOLB

[10]

Drucker H., 1993, ADV NEURAL INFORM PR, P42

← 1 2 3 4 5 →