NETWORK INFORMATION CRITERION - DETERMINING THE NUMBER OF HIDDEN UNITS FOR AN ARTIFICIAL NEURAL-NETWORK MODEL

被引:411
作者
MURATA, N
YOSHIZAWA, S
AMARI, S
机构
[1] Department of Mathematical Engineering and Information Physics, Faculty of Engineering, University of Tokyo, Bunkyo-ku
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 1994年 / 5卷 / 06期
关键词
D O I
10.1109/72.329683
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of model selection, or determination of the number of hidden units, can be approached statistically, by generalizing Akaike's information Criterion (AIC) to be applicable to unfaithful (i.e., unrealizable) models with general loss criteria including regularization terms. The relation between the training error and the generalization error is studied in terms of the number of the training examples and the complexity of It network which reduces to the number of parameters in the ordinary statistical theory of the AIC. This relation leads to a new Network Information Criterion (NIC) which is useful for selecting the optimal network model based on a given training set.
引用
收藏
页码:865 / 872
页数:8
相关论文
共 16 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]   STATISTICAL-THEORY OF LEARNING-CURVES UNDER ENTROPIC LOSS CRITERION [J].
AMARI, S ;
MURATA, N .
NEURAL COMPUTATION, 1993, 5 (01) :140-153
[3]   A THEORY OF ADAPTIVE PATTERN CLASSIFIERS [J].
AMARI, S .
IEEE TRANSACTIONS ON ELECTRONIC COMPUTERS, 1967, EC16 (03) :299-+
[4]  
[Anonymous], 1987, LEARNING INTERNAL RE
[5]  
FORGEL D, 1991, IEEE T NEURAL NETWOR, V2, P490
[6]  
HAGIWARA K, 1993, T IEICE, V6, P2058
[7]  
MOODY JE, 1992, ADV NEURAL INFORMATI, V4
[8]  
MURATA N, 1991, ARTIFICIAL NEURAL NETWORKS, VOLS 1 AND 2, P9
[9]  
MURATA N, 1993, ADV NEURAL INFORMATI, V5, P607
[10]  
MURATA N, 1992, THESIS U TOYKO