Optimized tracking control using reinforcement learning and backstepping technique for canonical nonlinear unknown dynamic system

被引：1

作者：

Song, Yanfen ^{[1
,2
]}

Li, Zijun ^{[1
,2
]}

Wen, Guoxing ^{[2
,3
]}

机构：

[1] Qilu Univ Technol, Shandong Acad Sci, Sch Math & Stat, Jinan, Peoples R China

[2] Shandong Univ Aeronaut, Coll Sci, Binzhou, Peoples R China

[3] Shandong Univ Aeronaut, Coll Sci, Binzhou 256600, Shandong, Peoples R China

来源：

OPTIMAL CONTROL APPLICATIONS & METHODS | 2024年 / 45卷 / 04期

基金：

中国国家自然科学基金;

关键词：

backstepping; identifier-critic-actor architecture; nonlinear canonical system; optimal control; reinforcement learning; CONTINUOUS-TIME; NEURAL-CONTROL; PERFORMANCE; ALGORITHM;

D O I：

10.1002/oca.3115

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The work addresses the optimized tracking control problem by combining both reinforcement learning (RL) and backstepping technique for the canonical nonlinear unknown dynamic system. Since such dynamic system contains multiple state variables with differential relation, the backstepping technique is considered by making a virtual control sequence in accordance with Lyapunov functions. In the last backstepping step, the optimized actual control is derived by performing the RL under identifier-critic-actor structure, where RL is to overcome the difficulty coming from solving Hamilton-Jacobi-Bellman (HJB) equation. Different from the traditional RL optimizing methods that find the RL updating laws from the square of the HJB equation's approximation, this optimized control is to find the RL training laws from the negative gradient of a simple positive definite function, which is equivalent to the HJB equation. The result shows that this optimized control can obviously alleviate the algorithm complexity. Meanwhile, it can remove the requirement of known dynamic as well. Finally, theory and simulation indicate the feasibility of this optimized control. Executive process of the optimized backstepping control. image

引用

页码：1655 / 1671

页数：17

共 50 条

[31] Control System Design for Dynamic Positioning Ships Using Nonlinear Passive Observer Backstepping
Xie, Dengfeng
Jia, Baozhu
Ren, Yafei
2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 4221 - 4226
[32] GLOBALLY DECENTRALIZED ADAPTIVE BACKSTEPPING NEURAL NETWORK TRACKING CONTROL FOR UNKNOWN NONLINEAR INTERCONNECTED SYSTEMS
Chen, Weisheng
Li, Junmin
ASIAN JOURNAL OF CONTROL, 2010, 12 (01) : 96 - 102
[33] Adaptive Neural Network Optimal Backstepping Control of Strict Feedback Nonlinear Systems via Reinforcement Learning
Zhong, Mei
Cao, Jinde
Liu, Heng
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2025, 9 (01): : 832 - 847
[34] Adaptive Fault-Tolerant Tracking Control for Affine Nonlinear Systems With Unknown Dynamics via Reinforcement Learning
Roshanravan, Sajad
Shamaghdari, Saeed
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (01) : 569 - 580
[35] Backstepping Command Filter Control for Electromechanical Servo Systems with Unknown Dynamics Based on Reinforcement Learning
Xu, Chenchen
Hu, Jian
Wang, Jiong
Deng, Wenxiang
Yao, Jianyong
Zhao, Xiaoli
ACTUATORS, 2025, 14 (03)
[36] Reinforcement learning-based optimal control of unknown constrained-input nonlinear systems using simulated experience
Hamed Jabbari Asl
Eiji Uchibe
Nonlinear Dynamics, 2023, 111 : 16093 - 16110
[37] Neuro-optimal control for a class of unknown nonlinear dynamic systems using SN-DHP technique
Wang, Ding
Liu, Derong
NEUROCOMPUTING, 2013, 121 : 218 - 225
[38] Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning
Yang, Xiong
Liu, Derong
Wang, Ding
Wei, Qinglai
NEURAL NETWORKS, 2014, 55 : 30 - 41
[39] Optimized Inverse Dead-Zone Control Using Reinforcement Learning for a Class of Nonlinear Systems
Sun, Wenxia
Ma, Shuaihua
Li, Bin
Wen, Guoxing
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2024, 38 (12) : 3855 - 3864
[40] Adaptive Neural Network Optimized Control Using Reinforcement Learning of Critic-Actor Architecture for a Class of Non-Affine Nonlinear Systems
Yang, Xue
Li, Bin
Wen, Guoxing
IEEE ACCESS, 2021, 9 : 141758 - 141765

← 1 2 3 4 5 →