Bayesian nonlinear model selection and neural networks: A conjugate prior approach

被引:34
作者
Vila, JP [1 ]
Wagner, V [1 ]
Neveu, P [1 ]
机构
[1] INRA, ENSAM, Lab Anal Syst & Biometrie, F-34060 Montpellier, France
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 2000年 / 11卷 / 02期
关键词
Bayesian model selection; conjugate prior distribution; empirical Bayes methods; expected utility criterion; feedforward neural network; nonlinear regression;
D O I
10.1109/72.838999
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to select the best predictive neural-network architecture in a set of several candidate networks, me propose a general Bayesian nonlinear regression model comparison procedure, based on the maximization of an expected utility criterion. This criterion selects the model under which the training set achieves the highest level of internal consistency, through the predictive probability distribution of each model, The density of this distribution is computed as the model posterior predictive density and is asymptotically approximated from the assumed Gaussian likelihood of the data set and the related conjugate prior density of the parameters. The use of such a conjugate prior allows the analytic calculation of the parameter posterior and predictive posterior densities, in an empirical-Bayes-like approach. This Bayesian selection procedure allows us to compare general nonlinear regression models and in particular feedforward neural networks, in addition to embedded models as usual with asymptotic comparison tests.
引用
收藏
页码:265 / 278
页数:14
相关论文
共 29 条
  • [11] Hassibi B., 1994, ADV NEURAL INFORMATI, V6, P263
  • [12] ASYMPTOTIC PROPERTIES OF NON-LINEAR LEAST SQUARES ESTIMATORS
    JENNRICH, RI
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1969, 40 (02): : 633 - &
  • [13] KUNG SY, 1988, IEEE INT C NEURAL NE, V1, P363
  • [14] Constructive algorithms for structure learning in feedforward neural networks for regression problems
    Kwok, TY
    Yeung, DY
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (03): : 630 - 645
  • [15] LeCun Y., 1990, Advances in neural information processing systems, P598
  • [16] A PRACTICAL BAYESIAN FRAMEWORK FOR BACKPROPAGATION NETWORKS
    MACKAY, DJC
    [J]. NEURAL COMPUTATION, 1992, 4 (03) : 448 - 472
  • [17] MACKAY DJC, 1995, MAXIMUM ENTROPY BAYE
  • [18] Maritz J.S., 2017, Empirical Bayes Methods, V2nd
  • [19] NEAL RM, 1996, BAYESIAN LEARNING NE, P187
  • [20] PRUNING ALGORITHMS - A SURVEY
    REED, R
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1993, 4 (05): : 740 - 747