Momentum Acceleration of Quasi-Newton Training for Neural Networks

被引:5
作者
Mahboubi, Shahrzad [1 ]
Indrapriyadarsini, S. [2 ]
Ninomiya, Hiroshi [1 ]
Asai, Hideki [2 ]
机构
[1] Shonan Inst Technol, 1-1-25 Tsujido Nishikaigan, Fujisawa, Kanagawa 2518511, Japan
[2] Shizuoka Univ, Naka Ku, 3-5-1 Johoku, Hamamatsu, Shizuoka 4328011, Japan
来源
PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II | 2019年 / 11671卷
关键词
Neural networks; Training algorithm; Quasi-Newton method; Nesterov's accelerated quasi-Newton method; Momentum terms;
D O I
10.1007/978-3-030-29911-8_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a novel acceleration technique of quasi-Newton method (QN) using momentum terms for training in neural networks. Recently, Nesterov's accelerated quasi-Newton method (NAQ) has shown that the momentum term is effective in reducing the number of iterations and in accelerating its convergence speed. However, the gradients had to calculate two times during one iteration in the NAQ training. This increased the computation time of a training loop compared with the conventional QN. In this research, an improvement to NAQ is done by approximating the Nesterov's accelerated gradient used in NAQ as a linear combination of the current and previous gradients. Then the gradient is calculated only once per iteration same as QN. The performance of the proposed algorithm is evaluated through computer simulations on a benchmark problem of the function modeling and real-world problems of the microwave circuit modeling. The results show the significant acceleration in the computation time compared with conventional training algorithms.
引用
收藏
页码:268 / 281
页数:14
相关论文
共 50 条
  • [21] Optimal motion planning for a rigid spacecraft with two momentum wheels using quasi-Newton method
    Xinsheng Ge
    Qizhi Zhang
    Li-Qun Chen
    Acta Mechanica Solida Sinica, 2006, 19 : 334 - 340
  • [22] Optimal motion planning for a rigid spacecraft with two momentum wheels using quasi-Newton method
    Ge Xinsheng
    Zhang Qizhi
    Chen Li-Qun
    ACTA MECHANICA SOLIDA SINICA, 2006, 19 (04) : 334 - 340
  • [23] OPTIMAL MOTION PLANNING FOR A RIGID SPACECRAFT WITH TWO MOMENTUM WHEELS USING QUASI-NEWTON METHOD
    Ge Xinsheng Basic Science Courses Department
    Acta Mechanica Solida Sinica, 2006, (04) : 334 - 340
  • [24] Surrogate-based acceleration of quasi-Newton techniques for fluid-structure interaction simulations
    Delaisse, Nicolas
    Demeester, Toon
    Fauconnier, Dieter
    Degroote, Joris
    COMPUTERS & STRUCTURES, 2022, 260
  • [25] Smoothing Newton and quasi-Newton methods for mixed complementarity problems
    Li, DH
    Fukushima, M
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2000, 17 (2-3) : 203 - 230
  • [26] FPGA-based acceleration of Davidon-Fletcher-Powell quasi-Newton optimization method
    Liu Q.
    Sang R.
    Zhang Q.
    Transactions of Tianjin University, 2016, 22 (5) : 381 - 387
  • [27] FPGA-based Acceleration of Davidon-Fletcher-Powell Quasi-Newton Optimization Method
    刘强
    桑若愚
    张齐军
    Transactions of Tianjin University, 2016, (05) : 381 - 387
  • [28] Smoothing Newton and Quasi-Newton Methods for Mixed Complementarity Problems
    Donghui Li
    Masao Fukushima
    Computational Optimization and Applications, 2000, 17 : 203 - 230
  • [29] A Stochastic Quasi-Newton Method with Nesterov's Accelerated Gradient
    Indrapriyadarsini, S.
    Mahboubi, Shahrzad
    Ninomiya, Hiroshi
    Asai, Hideki
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT I, 2020, 11906 : 743 - 760
  • [30] ON THE BEHAVIOR OF BROYDENS CLASS OF QUASI-NEWTON METHODS
    Byrd, Richard H.
    Liu, Dong C.
    Nocedal, Jorge
    SIAM JOURNAL ON OPTIMIZATION, 1992, 2 (04) : 533 - 557