Momentum Acceleration of Quasi-Newton Training for Neural Networks

被引：5

作者：

Mahboubi, Shahrzad ^{[1
]}

Indrapriyadarsini, S. ^{[2
]}

Ninomiya, Hiroshi ^{[1
]}

Asai, Hideki ^{[2
]}

机构：

[1] Shonan Inst Technol, 1-1-25 Tsujido Nishikaigan, Fujisawa, Kanagawa 2518511, Japan

[2] Shizuoka Univ, Naka Ku, 3-5-1 Johoku, Hamamatsu, Shizuoka 4328011, Japan

来源：

PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II | 2019年 / 11671卷

关键词：

Neural networks; Training algorithm; Quasi-Newton method; Nesterov's accelerated quasi-Newton method; Momentum terms;

D O I：

10.1007/978-3-030-29911-8_21

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes a novel acceleration technique of quasi-Newton method (QN) using momentum terms for training in neural networks. Recently, Nesterov's accelerated quasi-Newton method (NAQ) has shown that the momentum term is effective in reducing the number of iterations and in accelerating its convergence speed. However, the gradients had to calculate two times during one iteration in the NAQ training. This increased the computation time of a training loop compared with the conventional QN. In this research, an improvement to NAQ is done by approximating the Nesterov's accelerated gradient used in NAQ as a linear combination of the current and previous gradients. Then the gradient is calculated only once per iteration same as QN. The performance of the proposed algorithm is evaluated through computer simulations on a benchmark problem of the function modeling and real-world problems of the microwave circuit modeling. The results show the significant acceleration in the computation time compared with conventional training algorithms.

引用

页码：268 / 281

页数：14

共 50 条

[21] Application of BP Neural Network Based on Quasi-Newton Method in Aerodynamic Modeling [J].

Yang Huiying ;

Huang Zhibin ;

Zhou Feng .

2017 16TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE (DCABES), 2017, :93-96

[22] Optimal motion planning for a rigid spacecraft with two momentum wheels using quasi-Newton method [J].

Ge Xinsheng ;

Zhang Qizhi ;

Chen Li-Qun .

ACTA MECHANICA SOLIDA SINICA, 2006, 19 (04) :334-340

[23] OPTIMAL MOTION PLANNING FOR A RIGID SPACECRAFT WITH TWO MOMENTUM WHEELS USING QUASI-NEWTON METHOD [J].

Ge Xinsheng Basic Science Courses Department Beijing Institute of Machinery Beijing China Zhang Qizhi Basic Science Courses Department Beijing Institute of Machinery Beijing ChinaChen LiQun Department of Mechanics Shanghai University Shanghai China .

Acta Mechanica Solida Sinica, 2006, (04) :334-340

[24] Surrogate-based acceleration of quasi-Newton techniques for fluid-structure interaction simulations [J].

Delaisse, Nicolas ;

Demeester, Toon ;

Fauconnier, Dieter ;

Degroote, Joris .

COMPUTERS & STRUCTURES, 2022, 260

[25] FPGA-based acceleration of Davidon-Fletcher-Powell quasi-Newton optimization method [J].

Liu Q. ;

Sang R. ;

Zhang Q. .

Transactions of Tianjin University, 2016, 22 (5) :381-387

[26] Smoothing Newton and quasi-Newton methods for mixed complementarity problems [J].

Li, DH ;

Fukushima, M .

COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2000, 17 (2-3) :203-230

[27] Smoothing Newton and Quasi-Newton Methods for Mixed Complementarity Problems [J].

Donghui Li ;

Masao Fukushima .

Computational Optimization and Applications, 2000, 17 :203-230

[28] FPGA-based Acceleration of Davidon-Fletcher-Powell Quasi-Newton Optimization Method [J].

刘强 ;

桑若愚 ;

张齐军 .

Transactions of Tianjin University, 2016, (05) :381-387

[29] A Stochastic Quasi-Newton Method with Nesterov's Accelerated Gradient [J].

Indrapriyadarsini, S. ;

Mahboubi, Shahrzad ;

Ninomiya, Hiroshi ;

Asai, Hideki .

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT I, 2020, 11906 :743-760

[30] ON THE BEHAVIOR OF BROYDENS CLASS OF QUASI-NEWTON METHODS [J].

Byrd, Richard H. ;

Liu, Dong C. ;

Nocedal, Jorge .

SIAM JOURNAL ON OPTIMIZATION, 1992, 2 (04) :533-557

← 1 2 3 4 5 →