Reinforcement learning-based optimal control of unknown constrained-input nonlinear systems using simulated experience

被引：0

作者：

Asl, Hamed Jabbari ^{[1
]}

Uchibe, Eiji ^{[1
]}

机构：

[1] ATR Computat Neurosci Labs, Dept Brain Robot Interface, 2-2-2 Hikaridai, Seika, Kyoto 6190288, Japan

来源：

NONLINEAR DYNAMICS | 2023年 / 111卷 / 17期

关键词：

Optimal control; Reinforcement learning; Input constraints; Uncertainty; APPROXIMATE OPTIMAL-CONTROL; TRACKING CONTROL; CONTINUOUS-TIME;

D O I：

10.1007/s11071-023-08688-0

中图分类号：

TH [机械、仪表工业];

学科分类号：

0802 ;

摘要：

Reinforcement learning (RL) provides a way to approximately solve optimal control problems. Furthermore, online solutions to such problems require a method that guarantees convergence to the optimal policy while also ensuring stability during the learning process. In this study, we develop an online RL-based optimal control framework for input-constrained nonlinear systems. Its design includes two new model identifiers that learn a system's drift dynamics: a slow identifier used to simulate experience that supports the convergence of optimal problem solutions and a fast identifier that keeps the system stable during the learning phase. This approach is a critic-only design, in which a new fast estimation law is developed for a critic network. A Lyapunov-based analysis shows that the estimated control policy converges to the optimal one. Moreover, simulation studies demonstrate the effectiveness of our developed control scheme.

引用

页码：16093 / 16110

页数：18

共 50 条

[21] Reinforcement Learning-Based Event-Triggered Predefined-Time Optimal Fuzzy Control for Nonlinear Constrained Systems
Song, Xiaona
Sun, Peng
Ahn, Choon Ki
Song, Shuai
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (08) : 4646 - 4659
[22] Optimal Leader-Follower Consensus for Constrained-Input Multiagent Systems With Completely Unknown Dynamics
Shi, Jing
Yue, Dong
Xie, Xiangpeng
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (02): : 1182 - 1191
[23] A policy iteration approach to online optimal control of continuous-time constrained-input systems
Modares, Hamidreza
Sistani, Mohammad-Bagher Naghibi
Lewis, Frank L.
ISA TRANSACTIONS, 2013, 52 (05) : 611 - 621
[24] H∞ control with constrained input for completely unknown nonlinear systems using data-driven reinforcement learning method
Jiang, He
Zhang, Huaguang
Luo, Yanhong
Cui, Xiaohong
NEUROCOMPUTING, 2017, 237 : 226 - 234
[25] Event-triggered-based online integral reinforcement learning for optimal control of unknown constrained nonlinear systems
Han, Xiumei
Zhao, Xudong
Wang, Ding
Wang, Bohui
INTERNATIONAL JOURNAL OF CONTROL, 2024, 97 (02) : 213 - 225
[26] Reinforcement learning-based output feedback control of nonlinear systems with input constraints
He, P
Jagannathan, S
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2005, 35 (01): : 150 - 154
[27] Integral Reinforcement Learning-Based Optimal Control for Nonzero-Sum Games of Multi-Player Input-Constrained Nonlinear Systems
Wu, Qiuye
Zhao, Bo
Liu, Derong
2021 7TH INTERNATIONAL CONFERENCE ON ROBOTICS AND ARTIFICIAL INTELLIGENCE, ICRAI 2021, 2021, : 59 - 63
[28] Optimal Synchronization Control of Heterogeneous Asymmetric Input-Constrained Unknown Nonlinear MASs via Reinforcement Learning
Xia, Lina
Li, Qing
Song, Ruizhuo
Modares, Hamidreza
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (03) : 520 - 532
[29] Online approximate solution of HJI equation for unknown constrained-input nonlinear continuous-time systems
Yang, Xiong
Liu, Derong
Ma, Hongwen
Xu, Yancai
INFORMATION SCIENCES, 2016, 328 : 435 - 454
[30] Adaptive Dynamic Programming for H∞ Control of Constrained-Input Nonlinear Systems
Yang Xiong
Liu Derong
Wei Qinglai
Wang Ding
2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 3027 - 3032

← 1 2 3 4 5 →