An Inexact Sequential Quadratic Programming Method for Learning and Control of Recurrent Neural Networks

被引：0

作者：

Adeoye, Adeyemi D. ^{[1
]}

Bemporad, Alberto ^{[1
]}

机构：

[1] IMT Sch Adv Studies Lucca, I-55100 Lucca, Italy

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2025年 / 36卷 / 02期

关键词：

Training; Recurrent neural networks; Optimization; Neural networks; Quadratic programming; Process control; Prediction algorithms; Gauss-Newton methods; markov decision processes; numerical optimization; recurrent neural networks (RNNs); reinforcement learning (RL); sequential quadratic programming (SQP); GRADIENT DESCENT; ALGORITHMS; TERM;

D O I：

10.1109/TNNLS.2024.3354855

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This article considers the two-stage approach to solving a partially observable Markov decision process (POMDP): the identification stage and the (optimal) control stage. We present an inexact sequential quadratic programming framework for recurrent neural network learning (iSQPRL) for solving the identification stage of the POMDP, in which the true system is approximated by a recurrent neural network (RNN) with dynamically consistent overshooting (DCRNN). We formulate the learning problem as a constrained optimization problem and study the quadratic programming (QP) subproblem with a convergence analysis under a restarted Krylov-subspace iterative scheme that implicitly exploits the structure of the associated Karush-Kuhn-Tucker (KKT) subsystem. In the control stage, where a feedforward neural network (FNN) controller is designed on top of the RNN model, we adapt a generalized Gauss-Newton (GGN) algorithm that exploits useful approximations to the curvature terms of the training data and selects its mini-batch step size using a known property of some regularization function. Simulation results are provided to demonstrate the effectiveness of our approach.

引用

页码：2762 / 2776

页数：15

共 50 条

[1] A NEW INEXACT SEQUENTIAL QUADRATIC PROGRAMMING ALGORITHM
倪勤
Numerical Mathematics A Journal of Chinese Universities(English Series), 2002, (01) : 1 - 12
[2] GLOBAL AND SUPERLINEAR CONVERGENCE OF INEXACT SEQUENTIAL QUADRATICALLY CONSTRAINED QUADRATIC PROGRAMMING METHOD FOR CONVEX PROGRAMMING
Kato, Atsushi
Narushima, Yasushi
Yabe, Hiroshi
PACIFIC JOURNAL OF OPTIMIZATION, 2012, 8 (03): : 609 - 629
[3] On the recurrent neural networks for solving general quadratic programming problems
Mladenov, V
NEUREL 2004: SEVENTH SEMINAR ON NEURAL NETWORK APPLICATIONS IN ELECTRICAL ENGINEERING, PROCEEDINGS, 2004, : 5 - 9
[4] Quantum recurrent neural networks for sequential learning
Li, Yanan
Wang, Zhimin
Han, Rongbing
Shi, Shangshang
Li, Jiaxin
Shang, Ruimin
Zheng, Haiyong
Zhong, Guoqiang
Gu, Yongjian
NEURAL NETWORKS, 2023, 166 : 148 - 161
[5] The Sequential Quadratic Programming Method
Fletcher, Roger
NONLINEAR OPTIMIZATION, 2010, 1989 : 165 - 214
[6] Recurrent Neural Networks with Fixed Time Convergence for Linear and Quadratic Programming
Diego Sanchez-Torres, Juan
Sanchez, Edgar N.
Loukianov, Alexander G.
2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
[7] Robust reinforcement learning control using integral quadratic constraints for recurrent neural networks
Anderson, Charles W.
Young, Peter Michael
Buehner, Michael R.
Knight, James N.
Bush, Keith A.
Hittle, Douglas C.
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2007, 18 (04): : 993 - 1002
[8] A sequential quadratic programming method for nonlinear model predictive control
Muske, KR
Howse, JW
LARGE-SCALE PDE-CONSTRAINED OPTIMIZATION, 2003, 30 : 253 - 267
[9] Residual Recurrent Neural Networks for Learning Sequential Representations
Yue, Boxuan
Fu, Junwei
Liang, Jun
INFORMATION, 2018, 9 (03)
[10] Learning in Polynomial Cellular Neural Networks using Quadratic Programming
Gomez-Ramirez, E.
Rubi-Velez, A.
Pazienza, G. E.
2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,

← 1 2 3 4 5 →