Game-Based Backstepping Design for Strict-Feedback Nonlinear Multi-Agent Systems Based on Reinforcement Learning

被引:28
|
作者
Long, Jia [1 ]
Yu, Dengxiu [1 ]
Wen, Guoxing [2 ,3 ]
Li, Li [4 ]
Wang, Zhen [5 ]
Chen, C. L. Philip [6 ,7 ]
机构
[1] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Peoples R China
[2] Binzhou Univ, Coll Sci, Binzhou 256600, Peoples R China
[3] Qilu Univ Technol, Sch Math & Stat, Jinan 250353, Peoples R China
[4] Shanxi Univ, Sch Comp & Informat Technol, Taiyuan 030006, Peoples R China
[5] Northwestern Polytech Univ, Sch Artificial Intelligence Opt & Elect, Xian 710072, Peoples R China
[6] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510641, Guangdong, Peoples R China
[7] Dalian Maritime Univ, Nav Coll, Dalian 116026, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Games; Backstepping; Artificial neural networks; Nonlinear dynamical systems; Multi-agent systems; Mathematical models; Optimal control; Game-based backstepping; high-order multi-agent system; neural network (NN); reinforcement learning (RL); tracking game; SMOOTH TRANSITION; SWARM CONTROL; TRACKING; DYNAMICS;
D O I
10.1109/TNNLS.2022.3177461
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, the game-based backstepping control method is proposed for the high-order nonlinear multi-agent system with unknown dynamic and input saturation. Reinforcement learning (RL) is employed to get the saddle point solution of the tracking game between each agent and the reference signal for achieving robust control. Specifically, the approximate optimal solution of the established Hamilton-Jacobi-Isaacs (HJI) equation is obtained by policy iteration for each subsystem, and the single network adaptive critic (SNAC) architecture is used to reduce the computational burden. In addition, based on the separation operation of the error term from the derivative of the value function, we achieve the different proportions of the two agents in the game to realize the regulation of the final equilibrium point. Different from the general use of the neural network for system identification, the unknown nonlinear dynamic term is approximated based on the state difference obtained by the command filter. Furthermore, a sufficient condition is established to guarantee that the whole system and each subsystem included are uniformly ultimately bounded. Finally, simulation results are given to show the effectiveness of the proposed method.
引用
收藏
页码:817 / 830
页数:14
相关论文
共 50 条
  • [11] Adaptive neural containment seeking of stochastic nonlinear strict-feedback multi-agent systems
    Yang, Yang
    Miao, Songtao
    Yue, Dong
    Xu, Chuang
    Ye, Duo
    NEUROCOMPUTING, 2020, 400 (393-400) : 393 - 400
  • [12] Differential graphical game-based multi-agent tracking control using integral reinforcement learning
    Guo, Yaning
    Sun, Qi
    Wang, Yintao
    Pan, Quan
    IET CONTROL THEORY AND APPLICATIONS, 2024, 18 (18) : 2766 - 2776
  • [13] Backstepping Control Based on the Dynamic Equilibrium State Theory for Strict-Feedback Nonlinear Systems
    Qiu, Dehui
    Wang, Qinglin
    Yang, Jie
    ELECTRICAL INFORMATION AND MECHATRONICS AND APPLICATIONS, PTS 1 AND 2, 2012, 143-144 : 360 - +
  • [14] Output Feedback Fuzzy Optimal Consensus Tracking Control for Strict-Feedback Nonlinear Multi-agent Systems
    Yuan, Liang-En
    Xiao, Yang
    Li, Tieshan
    Gao, Xiaoyang
    Zhou, Dalin
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2025,
  • [15] Distributed adaptive containment control of uncertain nonlinear multi-agent systems in strict-feedback form
    Yoo, Sung Jin
    AUTOMATICA, 2013, 49 (07) : 2145 - 2153
  • [16] Reinforcement learning-based optimized output feedback control of nonlinear strict-feedback systems with event sampled states
    Xin, Chun
    Li, Yuan-Xin
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2023, 37 (01) : 38 - 58
  • [17] Observer-Based Neuro-Adaptive Optimized Control of Strict-Feedback Nonlinear Systems With State Constraints
    Li, Yongming
    Liu, Yanjun
    Tong, Shaocheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (07) : 3131 - 3145
  • [18] Free-will arbitrary time stabilisation problem for a class of nonlinear strict-feedback systems based on the backstepping method
    Aslmostafa, E.
    Mirzaei, M. J.
    Asadollahi, M.
    Badamchizadeh, M. A.
    INTERNATIONAL JOURNAL OF CONTROL, 2023, 96 (11) : 2749 - 2761
  • [19] Optimized backstepping tracking control using reinforcement learning for strict-feedback nonlinear systems with monotone tube performance boundaries
    Zhang, Gengning
    Wang, Xin
    Wang, Ziming
    Pang, Ning
    INTERNATIONAL JOURNAL OF CONTROL, 2025,
  • [20] Cooperative Semi-Global Output Regulation of Nonlinear Strict-Feedback Multi-Agent Systems With Nonidentical Relative Degrees
    Su, Youfeng
    IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (03) : 709 - 719