Optimized tracking control using reinforcement learning and backstepping technique for canonical nonlinear unknown dynamic system

被引:1
|
作者
Song, Yanfen [1 ,2 ]
Li, Zijun [1 ,2 ]
Wen, Guoxing [2 ,3 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Sch Math & Stat, Jinan, Peoples R China
[2] Shandong Univ Aeronaut, Coll Sci, Binzhou, Peoples R China
[3] Shandong Univ Aeronaut, Coll Sci, Binzhou 256600, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
backstepping; identifier-critic-actor architecture; nonlinear canonical system; optimal control; reinforcement learning; CONTINUOUS-TIME; NEURAL-CONTROL; PERFORMANCE; ALGORITHM;
D O I
10.1002/oca.3115
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The work addresses the optimized tracking control problem by combining both reinforcement learning (RL) and backstepping technique for the canonical nonlinear unknown dynamic system. Since such dynamic system contains multiple state variables with differential relation, the backstepping technique is considered by making a virtual control sequence in accordance with Lyapunov functions. In the last backstepping step, the optimized actual control is derived by performing the RL under identifier-critic-actor structure, where RL is to overcome the difficulty coming from solving Hamilton-Jacobi-Bellman (HJB) equation. Different from the traditional RL optimizing methods that find the RL updating laws from the square of the HJB equation's approximation, this optimized control is to find the RL training laws from the negative gradient of a simple positive definite function, which is equivalent to the HJB equation. The result shows that this optimized control can obviously alleviate the algorithm complexity. Meanwhile, it can remove the requirement of known dynamic as well. Finally, theory and simulation indicate the feasibility of this optimized control. Executive process of the optimized backstepping control. image
引用
收藏
页码:1655 / 1671
页数:17
相关论文
共 50 条
  • [1] Optimized tracking control based on reinforcement learning for a class of high-order unknown nonlinear dynamic systems
    Wen, Guoxing
    Niu, Ben
    INFORMATION SCIENCES, 2022, 606 : 368 - 379
  • [2] Optimized Backstepping Tracking Control Using Reinforcement Learning for a Class of Stochastic Nonlinear Strict-Feedback Systems
    Wen, Guoxing
    Xu, Liguang
    Li, Bin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (03) : 1291 - 1303
  • [3] Optimized Backstepping Tracking Control Using Reinforcement Learning for Quadrotor Unmanned Aerial Vehicle System
    Wen, Guoxing
    Hao, Wei
    Feng, Weiwei
    Gao, Kaizhou
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (08): : 5004 - 5015
  • [4] Reinforcement learning-based optimized backstepping control of nonlinear strict feedback system with unknown control gain function
    Zhou, Ranran
    Wen, Guoxing
    Li, Bin
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2022, 43 (05) : 1358 - 1378
  • [5] Optimized tracking control using reinforcement learning strategy for a class of nonlinear systems
    Yang, Xue
    Li, Bin
    ASIAN JOURNAL OF CONTROL, 2023, 25 (03) : 2095 - 2104
  • [6] Simplified Optimized Backstepping Control for a Class of Nonlinear Strict-Feedback Systems With Unknown Dynamic Functions
    Wen, Guoxing
    Chen, C. L. Philip
    Ge, Shuzhi Sam
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (09) : 4567 - 4580
  • [7] Optimized Backstepping Consensus Control Using Reinforcement Learning for a Class of Nonlinear Strict-Feedback-Dynamic Multi-Agent Systems
    Wen, Guoxing
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (03) : 1524 - 1536
  • [8] Approximate Optimized Backstepping Control of Uncertain Fractional-Order Nonlinear Systems Based on Reinforcement Learning
    Li, Dongdong
    Dong, Jiuxiang
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (11): : 6723 - 6732
  • [9] Optimized Leader-Follower Consensus Control of Multi-QUAV Attitude System Using Reinforcement Learning and Backstepping
    Wen, Guoxing
    Song, Yanfen
    Li, Zijun
    Li, Bin
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2025, 9 (02): : 1469 - 1479
  • [10] Optimized Formation Control Using Simplified Reinforcement Learning for a Class of Multiagent Systems With Unknown Dynamics
    Wen, Guoxing
    Chen, C. L. Philip
    Li, Bin
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2020, 67 (09) : 7879 - 7888