NETWORK INFORMATION CRITERION - DETERMINING THE NUMBER OF HIDDEN UNITS FOR AN ARTIFICIAL NEURAL-NETWORK MODEL

被引：411

作者：

MURATA, N

YOSHIZAWA, S

AMARI, S

机构：

[1] Department of Mathematical Engineering and Information Physics, Faculty of Engineering, University of Tokyo, Bunkyo-ku

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS | 1994年 / 5卷 / 06期

关键词：

D O I：

10.1109/72.329683

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The problem of model selection, or determination of the number of hidden units, can be approached statistically, by generalizing Akaike's information Criterion (AIC) to be applicable to unfaithful (i.e., unrealizable) models with general loss criteria including regularization terms. The relation between the training error and the generalization error is studied in terms of the number of the training examples and the complexity of It network which reduces to the number of parameters in the ordinary statistical theory of the AIC. This relation leads to a new Network Information Criterion (NIC) which is useful for selecting the optimal network model based on a given training set.

引用

页码：865 / 872

页数：8

共 16 条

[1] NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].

AKAIKE, H .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723

[2] STATISTICAL-THEORY OF LEARNING-CURVES UNDER ENTROPIC LOSS CRITERION [J].

AMARI, S ;

MURATA, N .

NEURAL COMPUTATION, 1993, 5 (01) :140-153

[3] A THEORY OF ADAPTIVE PATTERN CLASSIFIERS [J].

AMARI, S .

IEEE TRANSACTIONS ON ELECTRONIC COMPUTERS, 1967, EC16 (03) :299-+

[4]

[Anonymous], 1987, LEARNING INTERNAL RE

[5]

FORGEL D, 1991, IEEE T NEURAL NETWOR, V2, P490

[6]

HAGIWARA K, 1993, T IEICE, V6, P2058

[7]

MOODY JE, 1992, ADV NEURAL INFORMATI, V4

[8]

MURATA N, 1991, ARTIFICIAL NEURAL NETWORKS, VOLS 1 AND 2, P9

[9]

MURATA N, 1993, ADV NEURAL INFORMATI, V5, P607

[10]

MURATA N, 1992, THESIS U TOYKO

← 1 2 →