Ensembling neural networks: Many could be better than all

被引:1399
|
作者
Zhou, ZH [1 ]
Wu, JX [1 ]
Tang, W [1 ]
机构
[1] Nanjing Univ, Natl Lab Novel Software Technol, Nanjing 210093, Peoples R China
关键词
neural networks; neural network ensemble; machine learning; selective ensemble; boosting; bagging; genetic algorithm; bias-variance decomposition;
D O I
10.1016/S0004-3702(02)00190-X
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural network ensemble is a learning paradigm where many neural networks are jointly used to solve a problem. In this paper, the relationship between the ensemble and its component neural networks is analyzed from the context of both regression and classification, which reveals that it may be better to ensemble many instead of all of the neural networks at hand. This result is interesting because at present, most approaches ensemble all the available neural networks for prediction. Then, in order to show that the appropriate neural networks for composing an ensemble can be effectively selected from a set of available neural networks, an approach named GASEN is presented. GASEN trains a number of neural networks at first. Then it assigns random weights to those networks and employs genetic algorithm to evolve the weights so that they can characterize to some extent the fitness of the neural networks in constituting an ensemble. Finally it selects some neural networks based on the evolved weights to make up the ensemble. A large empirical study shows that, compared with some popular ensemble approaches such as Bagging and Boosting, GASEN can generate neural network ensembles with far smaller sizes but stronger generalization ability. Furthermore, in order to understand the working mechanism of GASEN, the bias-variance decomposition of the error is provided in this paper, which shows that the success of GASEN may lie in that it can significantly reduce the bias as well as the variance. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:239 / 263
页数:25
相关论文
共 50 条
  • [1] Ensemble modelling or selecting the best model: Many could be better than one
    Barai, SV
    Reich, Y
    AI EDAM-ARTIFICIAL INTELLIGENCE FOR ENGINEERING DESIGN ANALYSIS AND MANUFACTURING, 1999, 13 (05): : 377 - 386
  • [2] A novel ensembling method to boost performance of neural networks
    Chakraborty, Manomita
    Biswas, Saroj Kumar
    Purkayastha, Biswajit
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2020, 32 (01) : 17 - 29
  • [3] Two is better than one: A diploid genotype for neural networks
    Calabretta, R
    Galbiati, R
    Nolfi, S
    Parisi, D
    NEURAL PROCESSING LETTERS, 1996, 4 (03) : 149 - 155
  • [4] Are random forests better suited than neural networks to augment RANS turbulence models?
    Volpiani, Pedro Stefanin
    INTERNATIONAL JOURNAL OF HEAT AND FLUID FLOW, 2024, 107
  • [5] Forecasting hourly NO2 concentrations by ensembling neural networks and mesoscale models
    Valput, Damir
    Navares, Ricardo
    Aznarte, Jose L.
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (13) : 9331 - 9342
  • [6] Structural Coverage Criteria for Neural Networks Could Be Misleading
    Li, Zenan
    Ma, Xiaoxing
    Xu, Chang
    Cao, Chun
    2019 IEEE/ACM 41ST INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: NEW IDEAS AND EMERGING RESULTS (ICSE-NIER 2019), 2019, : 89 - 92
  • [7] Better output prediction in hog slaughtering with neural networks?
    Muller, B
    FLEISCHWIRTSCHAFT, 1997, 77 (01): : 80 - 83
  • [8] Collaborative is better than Adversarial: Generative Cooperative Networks for Topic Clustering
    Lenzi, Andrea
    Velardi, Paola
    37TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2022, : 688 - 695
  • [9] On neural networks for generating better local optima in topology optimization
    Herrmann, Leon
    Sigmund, Ole
    Li, Viola Muning
    Vogl, Christian
    Kollmannsberger, Stefan
    STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2024, 67 (11)
  • [10] A Better Predictor of Marathon Race Times Based on Neural Networks
    Dracopoulos, Dimitris C.
    ARTIFICIAL INTELLIGENCE XXXIV, AI 2017, 2017, 10630 : 293 - 299