Optimized tracking control using reinforcement learning and backstepping technique for canonical nonlinear unknown dynamic system

被引:1
|
作者
Song, Yanfen [1 ,2 ]
Li, Zijun [1 ,2 ]
Wen, Guoxing [2 ,3 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Sch Math & Stat, Jinan, Peoples R China
[2] Shandong Univ Aeronaut, Coll Sci, Binzhou, Peoples R China
[3] Shandong Univ Aeronaut, Coll Sci, Binzhou 256600, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
backstepping; identifier-critic-actor architecture; nonlinear canonical system; optimal control; reinforcement learning; CONTINUOUS-TIME; NEURAL-CONTROL; PERFORMANCE; ALGORITHM;
D O I
10.1002/oca.3115
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The work addresses the optimized tracking control problem by combining both reinforcement learning (RL) and backstepping technique for the canonical nonlinear unknown dynamic system. Since such dynamic system contains multiple state variables with differential relation, the backstepping technique is considered by making a virtual control sequence in accordance with Lyapunov functions. In the last backstepping step, the optimized actual control is derived by performing the RL under identifier-critic-actor structure, where RL is to overcome the difficulty coming from solving Hamilton-Jacobi-Bellman (HJB) equation. Different from the traditional RL optimizing methods that find the RL updating laws from the square of the HJB equation's approximation, this optimized control is to find the RL training laws from the negative gradient of a simple positive definite function, which is equivalent to the HJB equation. The result shows that this optimized control can obviously alleviate the algorithm complexity. Meanwhile, it can remove the requirement of known dynamic as well. Finally, theory and simulation indicate the feasibility of this optimized control. Executive process of the optimized backstepping control. image
引用
收藏
页码:1655 / 1671
页数:17
相关论文
共 50 条
  • [31] Control System Design for Dynamic Positioning Ships Using Nonlinear Passive Observer Backstepping
    Xie, Dengfeng
    Jia, Baozhu
    Ren, Yafei
    2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 4221 - 4226
  • [32] GLOBALLY DECENTRALIZED ADAPTIVE BACKSTEPPING NEURAL NETWORK TRACKING CONTROL FOR UNKNOWN NONLINEAR INTERCONNECTED SYSTEMS
    Chen, Weisheng
    Li, Junmin
    ASIAN JOURNAL OF CONTROL, 2010, 12 (01) : 96 - 102
  • [33] Adaptive Neural Network Optimal Backstepping Control of Strict Feedback Nonlinear Systems via Reinforcement Learning
    Zhong, Mei
    Cao, Jinde
    Liu, Heng
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2025, 9 (01): : 832 - 847
  • [34] Adaptive Fault-Tolerant Tracking Control for Affine Nonlinear Systems With Unknown Dynamics via Reinforcement Learning
    Roshanravan, Sajad
    Shamaghdari, Saeed
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (01) : 569 - 580
  • [35] Backstepping Command Filter Control for Electromechanical Servo Systems with Unknown Dynamics Based on Reinforcement Learning
    Xu, Chenchen
    Hu, Jian
    Wang, Jiong
    Deng, Wenxiang
    Yao, Jianyong
    Zhao, Xiaoli
    ACTUATORS, 2025, 14 (03)
  • [36] Reinforcement learning-based optimal control of unknown constrained-input nonlinear systems using simulated experience
    Hamed Jabbari Asl
    Eiji Uchibe
    Nonlinear Dynamics, 2023, 111 : 16093 - 16110
  • [37] Neuro-optimal control for a class of unknown nonlinear dynamic systems using SN-DHP technique
    Wang, Ding
    Liu, Derong
    NEUROCOMPUTING, 2013, 121 : 218 - 225
  • [38] Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning
    Yang, Xiong
    Liu, Derong
    Wang, Ding
    Wei, Qinglai
    NEURAL NETWORKS, 2014, 55 : 30 - 41
  • [39] Optimized Inverse Dead-Zone Control Using Reinforcement Learning for a Class of Nonlinear Systems
    Sun, Wenxia
    Ma, Shuaihua
    Li, Bin
    Wen, Guoxing
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2024, 38 (12) : 3855 - 3864
  • [40] Adaptive Neural Network Optimized Control Using Reinforcement Learning of Critic-Actor Architecture for a Class of Non-Affine Nonlinear Systems
    Yang, Xue
    Li, Bin
    Wen, Guoxing
    IEEE ACCESS, 2021, 9 : 141758 - 141765