ONLINE LEARNING IN SOFT COMMITTEE MACHINES

被引:168
作者
SAAD, D [1 ]
SOLLA, SA [1 ]
机构
[1] NIELS BOHR INST,CONNECT,DK-2100 COPENHAGEN,DENMARK
来源
PHYSICAL REVIEW E | 1995年 / 52卷 / 04期
关键词
D O I
10.1103/PhysRevE.52.4225
中图分类号
O35 [流体力学]; O53 [等离子体物理学];
学科分类号
070204 ; 080103 ; 080704 ;
摘要
The problem of on-line learning in two-layer neural networks is studied within the framework of statistical mechanics. A fully connected committee machine with K hidden units is trained by gradient descent to perform a task defined by a teacher committee machine with M hidden units acting on randomly drawn inputs: The approach, based on a direct averaging over the activation of the hidden units, results in a set of first-order differential equations that describes the dynamical evolution of the overlaps among the various hidden units and allows for a computation of the generalization error. The equations of motion are obtained analytically for general K and M and provide a powerful tool used here to study a variety of realizable, overrealizable, and unrealizable learning scenarios and to analyze the role of the learning rate in controlling the evolution and convergence of the learning process.
引用
收藏
页码:4225 / 4243
页数:19
相关论文
共 17 条
[1]  
BARKAI N, 1995, ADV NEURAL INFORMATI, V7, P303
[2]   LEARNING BY ONLINE GRADIENT DESCENT [J].
BIEHL, M ;
SCHWARZE, H .
JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1995, 28 (03) :643-656
[3]  
BIEHL M, 1994, EUROPHYS LETT, V25, P525
[4]   ONLINE LEARNING IN THE COMMITTEE MACHINE [J].
COPELLI, M ;
CATICHA, N .
JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1995, 28 (06) :1615-1625
[5]  
CYBENKO G, 1988, MATH CONTROL SIGNAL, V2, P303
[6]  
Denker J., 1987, Complex Systems, V1, P877
[7]   LEARNING-PROCESSES IN NEURAL NETWORKS [J].
HESKES, TM ;
KAPPEN, B .
PHYSICAL REVIEW A, 1991, 44 (04) :2718-2726
[8]   MULTILAYER FEEDFORWARD NETWORKS ARE UNIVERSAL APPROXIMATORS [J].
HORNIK, K ;
STINCHCOMBE, M ;
WHITE, H .
NEURAL NETWORKS, 1989, 2 (05) :359-366
[9]   PERFECT LOSS OF GENERALIZATION DUE TO NOISE IN K=2 PARITY MACHINES [J].
KABASHIMA, Y .
JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1994, 27 (06) :1917-1927
[10]   OPTIMAL GENERALIZATION IN PERCEPTRONS [J].
KINOUCHI, O ;
CATICHA, N .
JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1992, 25 (23) :6243-6250