An Inexact Sequential Quadratic Programming Method for Learning and Control of Recurrent Neural Networks

被引:0
|
作者
Adeoye, Adeyemi D. [1 ]
Bemporad, Alberto [1 ]
机构
[1] IMT Sch Adv Studies Lucca, I-55100 Lucca, Italy
关键词
Training; Recurrent neural networks; Optimization; Neural networks; Quadratic programming; Process control; Prediction algorithms; Gauss-Newton methods; markov decision processes; numerical optimization; recurrent neural networks (RNNs); reinforcement learning (RL); sequential quadratic programming (SQP); GRADIENT DESCENT; ALGORITHMS; TERM;
D O I
10.1109/TNNLS.2024.3354855
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article considers the two-stage approach to solving a partially observable Markov decision process (POMDP): the identification stage and the (optimal) control stage. We present an inexact sequential quadratic programming framework for recurrent neural network learning (iSQPRL) for solving the identification stage of the POMDP, in which the true system is approximated by a recurrent neural network (RNN) with dynamically consistent overshooting (DCRNN). We formulate the learning problem as a constrained optimization problem and study the quadratic programming (QP) subproblem with a convergence analysis under a restarted Krylov-subspace iterative scheme that implicitly exploits the structure of the associated Karush-Kuhn-Tucker (KKT) subsystem. In the control stage, where a feedforward neural network (FNN) controller is designed on top of the RNN model, we adapt a generalized Gauss-Newton (GGN) algorithm that exploits useful approximations to the curvature terms of the training data and selects its mini-batch step size using a known property of some regularization function. Simulation results are provided to demonstrate the effectiveness of our approach.
引用
收藏
页码:2762 / 2776
页数:15
相关论文
共 50 条
  • [2] GLOBAL AND SUPERLINEAR CONVERGENCE OF INEXACT SEQUENTIAL QUADRATICALLY CONSTRAINED QUADRATIC PROGRAMMING METHOD FOR CONVEX PROGRAMMING
    Kato, Atsushi
    Narushima, Yasushi
    Yabe, Hiroshi
    PACIFIC JOURNAL OF OPTIMIZATION, 2012, 8 (03): : 609 - 629
  • [3] On the recurrent neural networks for solving general quadratic programming problems
    Mladenov, V
    NEUREL 2004: SEVENTH SEMINAR ON NEURAL NETWORK APPLICATIONS IN ELECTRICAL ENGINEERING, PROCEEDINGS, 2004, : 5 - 9
  • [4] Quantum recurrent neural networks for sequential learning
    Li, Yanan
    Wang, Zhimin
    Han, Rongbing
    Shi, Shangshang
    Li, Jiaxin
    Shang, Ruimin
    Zheng, Haiyong
    Zhong, Guoqiang
    Gu, Yongjian
    NEURAL NETWORKS, 2023, 166 : 148 - 161
  • [5] The Sequential Quadratic Programming Method
    Fletcher, Roger
    NONLINEAR OPTIMIZATION, 2010, 1989 : 165 - 214
  • [6] Recurrent Neural Networks with Fixed Time Convergence for Linear and Quadratic Programming
    Diego Sanchez-Torres, Juan
    Sanchez, Edgar N.
    Loukianov, Alexander G.
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [7] Robust reinforcement learning control using integral quadratic constraints for recurrent neural networks
    Anderson, Charles W.
    Young, Peter Michael
    Buehner, Michael R.
    Knight, James N.
    Bush, Keith A.
    Hittle, Douglas C.
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2007, 18 (04): : 993 - 1002
  • [8] A sequential quadratic programming method for nonlinear model predictive control
    Muske, KR
    Howse, JW
    LARGE-SCALE PDE-CONSTRAINED OPTIMIZATION, 2003, 30 : 253 - 267
  • [9] Residual Recurrent Neural Networks for Learning Sequential Representations
    Yue, Boxuan
    Fu, Junwei
    Liang, Jun
    INFORMATION, 2018, 9 (03)
  • [10] Learning in Polynomial Cellular Neural Networks using Quadratic Programming
    Gomez-Ramirez, E.
    Rubi-Velez, A.
    Pazienza, G. E.
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,