SPEED UP LEARNING AND NETWORK OPTIMIZATION WITH EXTENDED BACK PROPAGATION

被引:58
作者
SPERDUTI, A
STARITA, A
机构
[1] Univ of Pisa, Pisa, Italy
关键词
SUPERVISED LEARNING; BACK PROPAGATION; SPEED-UP LEARNING; NETWORK OPTIMIZATION;
D O I
10.1016/0893-6080(93)90004-G
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Methods to speed up learning in back propagation and to optimize the network architecture have been recently studied. This paper shows how adaptation of the steepness of the sigmoids during learning treats these two topics in a common framework. The adaptation of the steepness of the sigmoids is obtained by gradient descent. The resulting learning dynamics can be simulated by a standard network with fixed sigmoids and a learning rule whose main component is a gradient descent with adaptive learning parameters. A law linking variation on the weights to variation on the steepness of the sigmoids is discovered. Optimization of units is obtained by introducing a tendency to decay to zero in the steepness values. This decay corresponds to a decay of the sensitivity of the units. Units with low final sensitivity can be removed after a given transformation of the biases of the network. A decreasing initial distribution of the steepness values is suggested to obtain a good compromise between speed of learning and network optimization. Simulation of the proposed procedure has shown an improvement of the mean convergence rate with respect to the standard back propagation and good optimization performance. Several 4-3-1 networks for the four bits parity problem were discovered.
引用
收藏
页码:365 / 383
页数:19
相关论文
共 26 条
[1]  
[Anonymous], 1987, LEARNING INTERNAL RE
[2]  
BACHMANN CM, 1990, THESIS BROWN U PROVI
[3]  
CATER JP, 1987, 1ST IEEE INT C NEUR, V11, P645
[4]  
CHAUVIN Y, 1989, ADV NEURAL INFORMATI, V1, P519
[5]  
FAHLMAN SE, 1988, 1988 P CONN MOD SUMM, P38
[6]  
HANSON SJ, 1989, ADV NEURAL INFORMATI, V1, P177
[7]  
Hinton G.E., 1986, P 8 ANN C COGN SCI S, V1, P12
[8]   INCREASED RATES OF CONVERGENCE THROUGH LEARNING RATE ADAPTATION [J].
JACOBS, RA .
NEURAL NETWORKS, 1988, 1 (04) :295-307
[9]  
JUTTEN C, 1991, LECT NOTES COMPUT SC, V540, P54, DOI 10.1007/BFb0035877
[10]  
Karnin E D, 1990, IEEE Trans Neural Netw, V1, P239, DOI 10.1109/72.80236