Reinforcement learning-based optimal control of unknown constrained-input nonlinear systems using simulated experience

被引:0
作者
Asl, Hamed Jabbari [1 ]
Uchibe, Eiji [1 ]
机构
[1] ATR Computat Neurosci Labs, Dept Brain Robot Interface, 2-2-2 Hikaridai, Seika, Kyoto 6190288, Japan
关键词
Optimal control; Reinforcement learning; Input constraints; Uncertainty; APPROXIMATE OPTIMAL-CONTROL; TRACKING CONTROL; CONTINUOUS-TIME;
D O I
10.1007/s11071-023-08688-0
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Reinforcement learning (RL) provides a way to approximately solve optimal control problems. Furthermore, online solutions to such problems require a method that guarantees convergence to the optimal policy while also ensuring stability during the learning process. In this study, we develop an online RL-based optimal control framework for input-constrained nonlinear systems. Its design includes two new model identifiers that learn a system's drift dynamics: a slow identifier used to simulate experience that supports the convergence of optimal problem solutions and a fast identifier that keeps the system stable during the learning phase. This approach is a critic-only design, in which a new fast estimation law is developed for a critic network. A Lyapunov-based analysis shows that the estimated control policy converges to the optimal one. Moreover, simulation studies demonstrate the effectiveness of our developed control scheme.
引用
收藏
页码:16093 / 16110
页数:18
相关论文
共 50 条
  • [21] Reinforcement Learning-Based Event-Triggered Predefined-Time Optimal Fuzzy Control for Nonlinear Constrained Systems
    Song, Xiaona
    Sun, Peng
    Ahn, Choon Ki
    Song, Shuai
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (08) : 4646 - 4659
  • [22] Optimal Leader-Follower Consensus for Constrained-Input Multiagent Systems With Completely Unknown Dynamics
    Shi, Jing
    Yue, Dong
    Xie, Xiangpeng
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (02): : 1182 - 1191
  • [23] A policy iteration approach to online optimal control of continuous-time constrained-input systems
    Modares, Hamidreza
    Sistani, Mohammad-Bagher Naghibi
    Lewis, Frank L.
    ISA TRANSACTIONS, 2013, 52 (05) : 611 - 621
  • [24] H∞ control with constrained input for completely unknown nonlinear systems using data-driven reinforcement learning method
    Jiang, He
    Zhang, Huaguang
    Luo, Yanhong
    Cui, Xiaohong
    NEUROCOMPUTING, 2017, 237 : 226 - 234
  • [25] Event-triggered-based online integral reinforcement learning for optimal control of unknown constrained nonlinear systems
    Han, Xiumei
    Zhao, Xudong
    Wang, Ding
    Wang, Bohui
    INTERNATIONAL JOURNAL OF CONTROL, 2024, 97 (02) : 213 - 225
  • [26] Reinforcement learning-based output feedback control of nonlinear systems with input constraints
    He, P
    Jagannathan, S
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2005, 35 (01): : 150 - 154
  • [27] Integral Reinforcement Learning-Based Optimal Control for Nonzero-Sum Games of Multi-Player Input-Constrained Nonlinear Systems
    Wu, Qiuye
    Zhao, Bo
    Liu, Derong
    2021 7TH INTERNATIONAL CONFERENCE ON ROBOTICS AND ARTIFICIAL INTELLIGENCE, ICRAI 2021, 2021, : 59 - 63
  • [28] Optimal Synchronization Control of Heterogeneous Asymmetric Input-Constrained Unknown Nonlinear MASs via Reinforcement Learning
    Xia, Lina
    Li, Qing
    Song, Ruizhuo
    Modares, Hamidreza
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (03) : 520 - 532
  • [29] Online approximate solution of HJI equation for unknown constrained-input nonlinear continuous-time systems
    Yang, Xiong
    Liu, Derong
    Ma, Hongwen
    Xu, Yancai
    INFORMATION SCIENCES, 2016, 328 : 435 - 454
  • [30] Adaptive Dynamic Programming for H∞ Control of Constrained-Input Nonlinear Systems
    Yang Xiong
    Liu Derong
    Wei Qinglai
    Wang Ding
    2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 3027 - 3032