ONLINE LEARNING IN SOFT COMMITTEE MACHINES

被引:160
作者
SAAD, D [1 ]
SOLLA, SA [1 ]
机构
[1] NIELS BOHR INST,CONNECT,DK-2100 COPENHAGEN,DENMARK
来源
PHYSICAL REVIEW E | 1995年 / 52卷 / 04期
关键词
D O I
10.1103/PhysRevE.52.4225
中图分类号
O35 [流体力学]; O53 [等离子体物理学];
学科分类号
070204 ; 080103 ; 080704 ;
摘要
The problem of on-line learning in two-layer neural networks is studied within the framework of statistical mechanics. A fully connected committee machine with K hidden units is trained by gradient descent to perform a task defined by a teacher committee machine with M hidden units acting on randomly drawn inputs: The approach, based on a direct averaging over the activation of the hidden units, results in a set of first-order differential equations that describes the dynamical evolution of the overlaps among the various hidden units and allows for a computation of the generalization error. The equations of motion are obtained analytically for general K and M and provide a powerful tool used here to study a variety of realizable, overrealizable, and unrealizable learning scenarios and to analyze the role of the learning rate in controlling the evolution and convergence of the learning process.
引用
收藏
页码:4225 / 4243
页数:19
相关论文
共 17 条
  • [1] BARKAI N, 1995, ADV NEURAL INFORMATI, V7, P303
  • [2] LEARNING BY ONLINE GRADIENT DESCENT
    BIEHL, M
    SCHWARZE, H
    [J]. JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1995, 28 (03): : 643 - 656
  • [3] BIEHL M, 1994, EUROPHYS LETT, V25, P525
  • [4] ONLINE LEARNING IN THE COMMITTEE MACHINE
    COPELLI, M
    CATICHA, N
    [J]. JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1995, 28 (06): : 1615 - 1625
  • [5] CYBENKO G, 1988, MATH CONTROL SIGNAL, V2, P303
  • [6] Denker J., 1987, Complex Systems, V1, P877
  • [7] LEARNING-PROCESSES IN NEURAL NETWORKS
    HESKES, TM
    KAPPEN, B
    [J]. PHYSICAL REVIEW A, 1991, 44 (04): : 2718 - 2726
  • [8] MULTILAYER FEEDFORWARD NETWORKS ARE UNIVERSAL APPROXIMATORS
    HORNIK, K
    STINCHCOMBE, M
    WHITE, H
    [J]. NEURAL NETWORKS, 1989, 2 (05) : 359 - 366
  • [9] PERFECT LOSS OF GENERALIZATION DUE TO NOISE IN K=2 PARITY MACHINES
    KABASHIMA, Y
    [J]. JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1994, 27 (06): : 1917 - 1927
  • [10] OPTIMAL GENERALIZATION IN PERCEPTRONS
    KINOUCHI, O
    CATICHA, N
    [J]. JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1992, 25 (23): : 6243 - 6250