Optimized tracking control using reinforcement learning and backstepping technique for canonical nonlinear unknown dynamic system

被引:1
|
作者
Song, Yanfen [1 ,2 ]
Li, Zijun [1 ,2 ]
Wen, Guoxing [2 ,3 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Sch Math & Stat, Jinan, Peoples R China
[2] Shandong Univ Aeronaut, Coll Sci, Binzhou, Peoples R China
[3] Shandong Univ Aeronaut, Coll Sci, Binzhou 256600, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
backstepping; identifier-critic-actor architecture; nonlinear canonical system; optimal control; reinforcement learning; CONTINUOUS-TIME; NEURAL-CONTROL; PERFORMANCE; ALGORITHM;
D O I
10.1002/oca.3115
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The work addresses the optimized tracking control problem by combining both reinforcement learning (RL) and backstepping technique for the canonical nonlinear unknown dynamic system. Since such dynamic system contains multiple state variables with differential relation, the backstepping technique is considered by making a virtual control sequence in accordance with Lyapunov functions. In the last backstepping step, the optimized actual control is derived by performing the RL under identifier-critic-actor structure, where RL is to overcome the difficulty coming from solving Hamilton-Jacobi-Bellman (HJB) equation. Different from the traditional RL optimizing methods that find the RL updating laws from the square of the HJB equation's approximation, this optimized control is to find the RL training laws from the negative gradient of a simple positive definite function, which is equivalent to the HJB equation. The result shows that this optimized control can obviously alleviate the algorithm complexity. Meanwhile, it can remove the requirement of known dynamic as well. Finally, theory and simulation indicate the feasibility of this optimized control. Executive process of the optimized backstepping control. image
引用
收藏
页码:1655 / 1671
页数:17
相关论文
共 50 条
  • [21] Reinforcement learning-based optimal control of unknown constrained-input nonlinear systems using simulated experience
    Asl, Hamed Jabbari
    Uchibe, Eiji
    NONLINEAR DYNAMICS, 2023, 111 (17) : 16093 - 16110
  • [22] Optimized Backstepping Combined With Dynamic Surface Technique for Single-Input-Single-Output Nonlinear Strict-Feedback System
    Wen, Guoxing
    Zhou, Ranran
    Zhao, Yanlong
    Niu, Ben
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (07): : 4210 - 4221
  • [23] Deep reinforcement learning based finite-horizon optimal tracking control for nonlinear system
    Kim, Jong Woo
    Park, Byung Jun
    Yoo, Haeun
    Lee, Jay H.
    Lee, Jong Min
    IFAC PAPERSONLINE, 2018, 51 (25): : 257 - 262
  • [24] Dynamic compensator-based near-optimal control for unknown nonaffine systems via integral reinforcement learning
    Lin, Jinquan
    Zhao, Bo
    Liu, Derong
    Wang, Yonghua
    NEUROCOMPUTING, 2024, 564
  • [25] Reinforcement learning based backstepping control of power system oscillations
    Karimi, Ali
    Eftekharnejad, Sara
    Feliachi, Ali
    ELECTRIC POWER SYSTEMS RESEARCH, 2009, 79 (11) : 1511 - 1520
  • [26] Optimal tracking control for completely unknown nonlinear discrete-time Markov jump systems using data-based reinforcement learning method
    Jiang, He
    Zhang, Huaguang
    Luo, Yanhong
    Wang, Junyi
    NEUROCOMPUTING, 2016, 194 : 176 - 182
  • [27] Cooperative control for swarming systems based on reinforcement learning in unknown dynamic environment
    Lan, Xuejing
    Liu, Yiwen
    Zhao, Zhijia
    NEUROCOMPUTING, 2020, 410 (410) : 410 - 418
  • [28] Robust backstepping output tracking control for SISO uncertain nonlinear systems with unknown virtual control coefficients
    Yu, Yao
    Zhong, Yi-Sheng
    INTERNATIONAL JOURNAL OF CONTROL, 2010, 83 (06) : 1182 - 1192
  • [29] Robust control for affine nonlinear system with unknown time-varying uncertainty under reinforcement learning framework
    Guo, Wenxin
    Qin, Weiwei
    Hu, Chen
    Liu, Jieyu
    IET CONTROL THEORY AND APPLICATIONS, 2023, 17 (17) : 2369 - 2377
  • [30] Simplified optimized control using reinforcement learning algorithm for a class of stochastic nonlinear systems
    Wen, Guoxing
    Chen, C. L. Philip
    Li, Wei Nian
    INFORMATION SCIENCES, 2020, 517 : 230 - 243