AN ANALYSIS OF PREMATURE SATURATION IN BACK-PROPAGATION LEARNING

被引:70
作者
LEE, Y
OH, SH
KIM, MW
机构
[1] Electronics and Telecommunications Research Institute, Daejeon
关键词
PREMATURE SATURATION; BACK PROPAGATION ALGORITHM; 1ST EPOCH; MULTILAYER PERCEPTRON;
D O I
10.1016/S0893-6080(05)80116-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The back propagation (BP) algorithm is widely used for finding optimum weights of multilayer neural networks in many pattern recognition applications. However, the critical drawbacks of the algorithm are its slow learning speed and convergence to local minima. One of the major reasons for these drawbacks is the ''premature saturation '' which is a phenomenon that the error of the neural network stays significantly high constant for some period of time during learning. It is known to be caused by an inappropriate set of initial weights. In this paper, the probability of premature saturation at the beginning epoch of learning procedure in the BP algorithm has been derived in terms of the maximum value of initial weights, the number of nodes in each layer, and the maximum slope of the sigmoidal activation function; it has been verified by the Monte Carlo simulation. Using this result, the premature saturation can be avoided with proper initial weight settings.
引用
收藏
页码:719 / 728
页数:10
相关论文
共 14 条
[2]  
BABA N, 1990, P INT JOINT C NEURAL, V1, P585
[3]  
CHEN JR, 1990, P INT JOINT C NEURAL, V1, P601
[4]  
CHEUNG RK, 1990, P INT JOINT C NEURAL, V1, P673
[5]  
Hecht-Nielsen R., 1992, NEURAL NETW PERCEPT, P65, DOI DOI 10.1016/B978-0-12-741252-8.50010-8
[6]  
HECHTNIELSEN, 1989, HNC NEUROSOFTWARE DO, P2
[7]   MULTILAYER FEEDFORWARD NETWORKS ARE UNIVERSAL APPROXIMATORS [J].
HORNIK, K ;
STINCHCOMBE, M ;
WHITE, H .
NEURAL NETWORKS, 1989, 2 (05) :359-366
[8]   INCREASED RATES OF CONVERGENCE THROUGH LEARNING RATE ADAPTATION [J].
JACOBS, RA .
NEURAL NETWORKS, 1988, 1 (04) :295-307
[9]   AN ADAPTIVE LEAST-SQUARES ALGORITHM FOR THE EFFICIENT TRAINING OF ARTIFICIAL NEURAL NETWORKS [J].
KOLLIAS, S ;
ANASTASSIOU, D .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS, 1989, 36 (08) :1092-1101
[10]  
LI S, 1990, P INT JOINT C NEURAL, V1, P697