Optimized tracking control using reinforcement learning and backstepping technique for canonical nonlinear unknown dynamic system

被引：1

作者：

Song, Yanfen ^{[1
,2
]}

Li, Zijun ^{[1
,2
]}

Wen, Guoxing ^{[2
,3
]}

机构：

[1] Qilu Univ Technol, Shandong Acad Sci, Sch Math & Stat, Jinan, Peoples R China

[2] Shandong Univ Aeronaut, Coll Sci, Binzhou, Peoples R China

[3] Shandong Univ Aeronaut, Coll Sci, Binzhou 256600, Shandong, Peoples R China

来源：

OPTIMAL CONTROL APPLICATIONS & METHODS | 2024年 / 45卷 / 04期

基金：

中国国家自然科学基金;

关键词：

backstepping; identifier-critic-actor architecture; nonlinear canonical system; optimal control; reinforcement learning; CONTINUOUS-TIME; NEURAL-CONTROL; PERFORMANCE; ALGORITHM;

D O I：

10.1002/oca.3115

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The work addresses the optimized tracking control problem by combining both reinforcement learning (RL) and backstepping technique for the canonical nonlinear unknown dynamic system. Since such dynamic system contains multiple state variables with differential relation, the backstepping technique is considered by making a virtual control sequence in accordance with Lyapunov functions. In the last backstepping step, the optimized actual control is derived by performing the RL under identifier-critic-actor structure, where RL is to overcome the difficulty coming from solving Hamilton-Jacobi-Bellman (HJB) equation. Different from the traditional RL optimizing methods that find the RL updating laws from the square of the HJB equation's approximation, this optimized control is to find the RL training laws from the negative gradient of a simple positive definite function, which is equivalent to the HJB equation. The result shows that this optimized control can obviously alleviate the algorithm complexity. Meanwhile, it can remove the requirement of known dynamic as well. Finally, theory and simulation indicate the feasibility of this optimized control. Executive process of the optimized backstepping control. image

引用

页码：1655 / 1671

页数：17

共 50 条

[21] Reinforcement learning-based optimal control of unknown constrained-input nonlinear systems using simulated experience
Asl, Hamed Jabbari
Uchibe, Eiji
NONLINEAR DYNAMICS, 2023, 111 (17) : 16093 - 16110
[22] Optimized Backstepping Combined With Dynamic Surface Technique for Single-Input-Single-Output Nonlinear Strict-Feedback System
Wen, Guoxing
Zhou, Ranran
Zhao, Yanlong
Niu, Ben
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (07): : 4210 - 4221
[23] Deep reinforcement learning based finite-horizon optimal tracking control for nonlinear system
Kim, Jong Woo
Park, Byung Jun
Yoo, Haeun
Lee, Jay H.
Lee, Jong Min
IFAC PAPERSONLINE, 2018, 51 (25): : 257 - 262
[24] Dynamic compensator-based near-optimal control for unknown nonaffine systems via integral reinforcement learning
Lin, Jinquan
Zhao, Bo
Liu, Derong
Wang, Yonghua
NEUROCOMPUTING, 2024, 564
[25] Reinforcement learning based backstepping control of power system oscillations
Karimi, Ali
Eftekharnejad, Sara
Feliachi, Ali
ELECTRIC POWER SYSTEMS RESEARCH, 2009, 79 (11) : 1511 - 1520
[26] Optimal tracking control for completely unknown nonlinear discrete-time Markov jump systems using data-based reinforcement learning method
Jiang, He
Zhang, Huaguang
Luo, Yanhong
Wang, Junyi
NEUROCOMPUTING, 2016, 194 : 176 - 182
[27] Cooperative control for swarming systems based on reinforcement learning in unknown dynamic environment
Lan, Xuejing
Liu, Yiwen
Zhao, Zhijia
NEUROCOMPUTING, 2020, 410 (410) : 410 - 418
[28] Robust backstepping output tracking control for SISO uncertain nonlinear systems with unknown virtual control coefficients
Yu, Yao
Zhong, Yi-Sheng
INTERNATIONAL JOURNAL OF CONTROL, 2010, 83 (06) : 1182 - 1192
[29] Robust control for affine nonlinear system with unknown time-varying uncertainty under reinforcement learning framework
Guo, Wenxin
Qin, Weiwei
Hu, Chen
Liu, Jieyu
IET CONTROL THEORY AND APPLICATIONS, 2023, 17 (17) : 2369 - 2377
[30] Simplified optimized control using reinforcement learning algorithm for a class of stochastic nonlinear systems
Wen, Guoxing
Chen, C. L. Philip
Li, Wei Nian
INFORMATION SCIENCES, 2020, 517 : 230 - 243

← 1 2 3 4 5 →