CREATING ARTIFICIAL NEURAL NETWORKS THAT GENERALIZE

被引：346

作者：

SIETSMA, J

DOW, RJF

机构：

来源：

NEURAL NETWORKS | 1991年 / 4卷 / 01期

关键词：

NEURAL NETWORKS; BACK-PROPAGATION; PATTERN RECOGNITION; GENERALIZATION; HIDDEN UNITS; PRUNING;

D O I：

10.1016/0893-6080(91)90033-2

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We develop a technique to test the hypothesis that multilayered, feed-forward networks with few units on the first hidden layer generalize better than networks with many units in the first layer. Large networks are trained to perform a classification task and the redundant units are removed ("pruning") to produce the smallest network capable of performing the task. A technique for inserting layers where pruning has introduced linear inseparability is also described. Two tests of ability to generalize are used - the ability to classify training inputs corrupted by noise and the ability to classify new patterns from each class. The hypothesis is found to be false for networks trained with noisy inputs. Pruning to the minimum number of units in the first layer produces networks which correctly classify the training set but generalize poorly compared with larger networks.

引用

页码：67 / 79

页数：13

共 13 条

[1] LEARNING THE HIDDEN STRUCTURE OF SPEECH
ELMAN, JL
ZIPSER, D
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1988, 83 (04) : 1615 - 1626
[2] ON THE APPROXIMATE REALIZATION OF CONTINUOUS-MAPPINGS BY NEURAL NETWORKS
FUNAHASHI, K
[J]. NEURAL NETWORKS, 1989, 2 (03) : 183 - 192
[3] MULTILAYER FEEDFORWARD NETWORKS ARE UNIVERSAL APPROXIMATORS
HORNIK, K
STINCHCOMBE, M
WHITE, H
[J]. NEURAL NETWORKS, 1989, 2 (05) : 359 - 366
[4] KUNG SY, 1988, 1988 P IEEE C NEUR N, V1, P363
[5] Lippmann R. P., 1988, Computer Architecture News, V16, P7, DOI [10.1109/MASSP.1987.1165576, 10.1145/44571.44572]
[6] A PATTERN-RECOGNITION APPROACH TO UNDERSTANDING THE MULTILAYER PERCEPTRON
LONGSTAFF, ID
CROSS, JF
[J]. PATTERN RECOGNITION LETTERS, 1987, 5 (05) : 315 - 319
[7] Nilsson N.J., 1965, LEARNING MACHINES
[8] Plaut D.C., 1986, CMUCS86126 COMP SCI
[9] LEARNING REPRESENTATIONS BY BACK-PROPAGATING ERRORS
RUMELHART, DE
HINTON, GE
WILLIAMS, RJ
[J]. NATURE, 1986, 323 (6088) : 533 - 536
[10] RUMELHART DE, 1985, EXPLORATIONS MICROST, V1, P318

← 1 2 →