Adaptive method of realizing natural gradient learning for multilayer perceptrons

被引:123
作者
Amari, S [1 ]
Park, H [1 ]
Fukumizu, K [1 ]
机构
[1] RIKEN, Brain Sci Inst, Wako, Saitama 3510198, Japan
关键词
D O I
10.1162/089976600300015420
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The natural gradient learning method is known to have ideal performances for on-line training of multilayer perceptrons. It avoids plateaus, which give rise to slow convergence of the backpropagation method. It is Fisher efficient, whereas the conventional method is not. However, for implementing the method, it is necessary to calculate the Fisher information matrix and its inverse, which is practically very difficult. This article proposes an adaptive method of directly obtaining the inverse of the Fisher information matrix. It generalizes the adaptive Gauss-Newton algorithms and provides a solid theoretical justification of them. Simulations show that the proposed adaptive method works very well for realizing natural gradient learning.
引用
收藏
页码:1399 / 1409
页数:11
相关论文
共 10 条
  • [1] Natural gradient works efficiently in learning
    Amari, S
    [J]. NEURAL COMPUTATION, 1998, 10 (02) : 251 - 276
  • [2] AMARI S, 2000, INFORMATION GEOMETRY
  • [3] Amari S., 1985, DIFFERENTIAL GEOMETR
  • [4] Bottou L., 1999, Online learning in neural networks, P9, DOI DOI 10.1017/CBO9780511569920.003
  • [5] The geometry of algorithms with orthogonality constraints
    Edelman, A
    Arias, TA
    Smith, ST
    [J]. SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 1998, 20 (02) : 303 - 353
  • [6] Efficient backprop
    LeCun, Y
    Bottou, L
    Orr, GB
    Müller, KR
    [J]. NEURAL NETWORKS: TRICKS OF THE TRADE, 1998, 1524 : 9 - 50
  • [7] PARK H, 2000, UNPUB ADAPTIVE NATUR
  • [8] Natural gradient descent for on-line learning
    Rattray, M
    Saad, D
    Amari, S
    [J]. PHYSICAL REVIEW LETTERS, 1998, 81 (24) : 5461 - 5464
  • [9] ONLINE LEARNING IN SOFT COMMITTEE MACHINES
    SAAD, D
    SOLLA, SA
    [J]. PHYSICAL REVIEW E, 1995, 52 (04): : 4225 - 4243
  • [10] Complexity issues in natural gradient descent method for training multilayer perceptrons
    Yang, HH
    Amari, S
    [J]. NEURAL COMPUTATION, 1998, 10 (08) : 2137 - 2157