Momentum Acceleration of Quasi-Newton Training for Neural Networks

被引：5

作者：

Mahboubi, Shahrzad ^{[1
]}

Indrapriyadarsini, S. ^{[2
]}

Ninomiya, Hiroshi ^{[1
]}

Asai, Hideki ^{[2
]}

机构：

[1] Shonan Inst Technol, 1-1-25 Tsujido Nishikaigan, Fujisawa, Kanagawa 2518511, Japan

[2] Shizuoka Univ, Naka Ku, 3-5-1 Johoku, Hamamatsu, Shizuoka 4328011, Japan

来源：

PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II | 2019年 / 11671卷

关键词：

Neural networks; Training algorithm; Quasi-Newton method; Nesterov's accelerated quasi-Newton method; Momentum terms;

D O I：

10.1007/978-3-030-29911-8_21

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes a novel acceleration technique of quasi-Newton method (QN) using momentum terms for training in neural networks. Recently, Nesterov's accelerated quasi-Newton method (NAQ) has shown that the momentum term is effective in reducing the number of iterations and in accelerating its convergence speed. However, the gradients had to calculate two times during one iteration in the NAQ training. This increased the computation time of a training loop compared with the conventional QN. In this research, an improvement to NAQ is done by approximating the Nesterov's accelerated gradient used in NAQ as a linear combination of the current and previous gradients. Then the gradient is calculated only once per iteration same as QN. The performance of the proposed algorithm is evaluated through computer simulations on a benchmark problem of the function modeling and real-world problems of the microwave circuit modeling. The results show the significant acceleration in the computation time compared with conventional training algorithms.

引用

页码：268 / 281

页数：14

共 50 条

[31] Quasi-Newton approach to nonnegative image restorations [J].

Hanke, M ;

Nagy, JG ;

Vogel, C .

LINEAR ALGEBRA AND ITS APPLICATIONS, 2000, 316 (1-3) :223-236

[32] Adjusting the BFGS update for quasi-Newton methods [J].

Hassan, Basim Abbas ;

Mohammed, Ahmed W. .

JOURNAL OF INTERDISCIPLINARY MATHEMATICS, 2025, 28 (01) :31-41

[33] ON THE BEHAVIOR OF BROYDENS CLASS OF QUASI-NEWTON METHODS [J].

Byrd, Richard H. ;

Liu, Dong C. ;

Nocedal, Jorge .

SIAM JOURNAL ON OPTIMIZATION, 1992, 2 (04) :533-557

[34] aSNAQ: An adaptive stochastic Nesterov's accelerated quasi-Newton method for training RNNs [J].

Sendilkkumaar, Indrapriyadarsini ;

Mahboubi, Shahrzad ;

Ninomiya, Hiroshi ;

Asai, Hideki .

IEICE NONLINEAR THEORY AND ITS APPLICATIONS, 2020, 11 (04) :409-421

[35] Phase equilibrium calculations with quasi-Newton methods [J].

Nichita, Dan Vladimir ;

Petitfrere, Martin .

FLUID PHASE EQUILIBRIA, 2015, 406 :194-208

[36] A NEW VARIATIONAL RESULT FOR QUASI-NEWTON FORMULAE [J].

Fletcher, R. .

SIAM JOURNAL ON OPTIMIZATION, 1991, 1 (01) :18-21

[37] Asynchronous parallel stochastic Quasi-Newton methods [J].

Tong, Qianqian ;

Liang, Guannan ;

Cai, Xingyu ;

Zhu, Chunjiang ;

Bi, Jinbo .

PARALLEL COMPUTING, 2021, 101

[38] THE LEAST PRIOR DEVIATION QUASI-NEWTON UPDATE [J].

MIFFLIN, RB ;

NAZARETH, JL .

MATHEMATICAL PROGRAMMING, 1994, 65 (03) :247-261

[39] An Improved Quasi-Newton Method for Unconstrained Optimization [J].

Fei Pusheng ;

Chen Zhong Department of Mathematics Wuhan University Wuhan China .

JOURNAL OF WUHAN UNIVERSITY (SOCIAL SCIENCE EDITION), 1996, (01) :35-37

[40] A quasi-Newton type method for equilibrium problems [J].

Leonardo A. Sousa ;

Susana Scheimberg ;

Pedro Jorge S. Santos ;

Paulo Sérgio M. Santos .

Numerical Algorithms, 2022, 89 :1129-1143

← 1 2 3 4 5 →