Game-Based Backstepping Design for Strict-Feedback Nonlinear Multi-Agent Systems Based on Reinforcement Learning

被引：28

作者：

Long, Jia ^{[1
]}

Yu, Dengxiu ^{[1
]}

Wen, Guoxing ^{[2
,3
]}

Li, Li ^{[4
]}

Wang, Zhen ^{[5
]}

Chen, C. L. Philip ^{[6
,7
]}

机构：

[1] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Peoples R China

[2] Binzhou Univ, Coll Sci, Binzhou 256600, Peoples R China

[3] Qilu Univ Technol, Sch Math & Stat, Jinan 250353, Peoples R China

[4] Shanxi Univ, Sch Comp & Informat Technol, Taiyuan 030006, Peoples R China

[5] Northwestern Polytech Univ, Sch Artificial Intelligence Opt & Elect, Xian 710072, Peoples R China

[6] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510641, Guangdong, Peoples R China

[7] Dalian Maritime Univ, Nav Coll, Dalian 116026, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年 / 35卷 / 01期

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

Games; Backstepping; Artificial neural networks; Nonlinear dynamical systems; Multi-agent systems; Mathematical models; Optimal control; Game-based backstepping; high-order multi-agent system; neural network (NN); reinforcement learning (RL); tracking game; SMOOTH TRANSITION; SWARM CONTROL; TRACKING; DYNAMICS;

D O I：

10.1109/TNNLS.2022.3177461

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this article, the game-based backstepping control method is proposed for the high-order nonlinear multi-agent system with unknown dynamic and input saturation. Reinforcement learning (RL) is employed to get the saddle point solution of the tracking game between each agent and the reference signal for achieving robust control. Specifically, the approximate optimal solution of the established Hamilton-Jacobi-Isaacs (HJI) equation is obtained by policy iteration for each subsystem, and the single network adaptive critic (SNAC) architecture is used to reduce the computational burden. In addition, based on the separation operation of the error term from the derivative of the value function, we achieve the different proportions of the two agents in the game to realize the regulation of the final equilibrium point. Different from the general use of the neural network for system identification, the unknown nonlinear dynamic term is approximated based on the state difference obtained by the command filter. Furthermore, a sufficient condition is established to guarantee that the whole system and each subsystem included are uniformly ultimately bounded. Finally, simulation results are given to show the effectiveness of the proposed method.

引用

页码：817 / 830

页数：14

共 50 条

[1] Optimized Backstepping Consensus Control Using Reinforcement Learning for a Class of Nonlinear Strict-Feedback-Dynamic Multi-Agent Systems
Wen, Guoxing
Chen, C. L. Philip
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (03) : 1524 - 1536
[2] Game-based Optimized Backstepping Control for Strict-feedback Systems With Input Constraints
Zhang, Liuliu
Jing, Hailong
Qian, Cheng
Hua, Changchun
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2024, 22 (08) : 2472 - 2482
[3] Optimized Backstepping Tracking Control Using Reinforcement Learning for a Class of Stochastic Nonlinear Strict-Feedback Systems
Wen, Guoxing
Xu, Liguang
Li, Bin
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (03) : 1291 - 1303
[4] Adaptive Tracking Control for Perturbed Strict-Feedback Nonlinear Systems Based on Optimized Backstepping Technique
Liu, Yongchao
Zhu, Qidan
Wen, Guoxing
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (02) : 853 - 865
[5] Optimized Backstepping Control Using Reinforcement Learning of Observer-Critic-Actor Architecture Based on Fuzzy System for a Class of Nonlinear Strict-Feedback Systems
Wen, Guoxing
Li, Bin
Niu, Ben
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2022, 30 (10) : 4322 - 4335
[6] Reinforcement learning-based optimized backstepping control for strict-feedback nonlinear systems subject to external disturbances
Qin, Yan
Cao, Liang
Lu, Qing
Pan, Yingnan
OPTIMAL CONTROL APPLICATIONS & METHODS, 2023, 44 (05) : 2724 - 2743
[7] Command Filter and Universal Approximator Based Backstepping Control Design for Strict-Feedback Nonlinear Systems With Uncertainty
Zheng, Xiaolong
Yang, Xuebo
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (03) : 1310 - 1317
[8] Simplified Optimized Backstepping Control for a Class of Nonlinear Strict-Feedback Systems With Unknown Dynamic Functions
Wen, Guoxing
Chen, C. L. Philip
Ge, Shuzhi Sam
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (09) : 4567 - 4580
[9] Game-based coordination control of multi-agent systems
Zhou, Liqi
Zheng, Yuanshi
Zhao, Qi
Xiao, Feng
Zhang, Yuling
SYSTEMS & CONTROL LETTERS, 2022, 169
[10] Output Feedback-Based Neural Adaptive Finite-Time Containment Control of Non-Strict Feedback Nonlinear Multi-Agent Systems
Zhao, Lin
Chen, Xiao
Yu, Jinpeng
Shi, Peng
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2022, 69 (02) : 847 - 858

← 1 2 3 4 5 →