Approximate Optimal Strategy for Multiagent System Pursuit-Evasion Game

被引:5
作者
Xu, Zhiqiang [1 ]
Yu, Dengxiu [2 ]
Liu, Yan-Jun [3 ]
Wang, Zhen [2 ]
机构
[1] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ, Sch Artificial Intelligence Opt & Elect, Xian 710072, Peoples R China
[3] Liaoning Univ Technol, Coll Sci, Jinzhou 121001, Peoples R China
来源
IEEE SYSTEMS JOURNAL | 2024年 / 18卷 / 03期
基金
中国国家自然科学基金;
关键词
Approximate optimal control; multiagent systems; pursuit-evasion games; reinforcement learning; CONSENSUS TRACKING;
D O I
10.1109/JSYST.2024.3432796
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we propose an approximate optimal control strategy for a class of nonlinear multiagent system pursuit-evasion games. Herein, multiple pursuers aim to capture multiple evaders trying to evade capture. Under the competitive framework, agents not only pursue their individual goals but also consider coordination with their teammates to achieve collective objectives. However, maintaining cohesion with teammates in existing distributed control methods has always been a challenge. To enhance team coordination, we employ a graph-theoretic approach to represent the relationships between agents. Based on this, we design a dynamic target graph algorithm to enhance the coordination among pursuers. The approximate optimal strategies for each agent are solved by utilizing the Hamilton-Jacobi-Isaacs equations of the system. As solving these equations becomes computationally intensive in multiagent scenarios, we propose a value-based single network adaptive critic network architecture. In addition, we consider scenarios where the numbers of agents on both sides are inconsistent and address the phenomenon of input saturation. Moreover, we provide sufficient conditions to prove the system's stability. Finally, simulations conducted in two representative scenarios, multiple-pursuer-one-evader and multiple-pursuer-multiple-evader, demonstrate the effectiveness of our proposed algorithm.
引用
收藏
页码:1669 / 1680
页数:12
相关论文
共 50 条
[31]   Neuroadaptive Sliding Mode Formation Control of Autonomous Underwater Vehicles With Uncertain Dynamics [J].
Wang, Jinqiang ;
Wang, Cong ;
Wei, Yingjie ;
Zhang, Chengju .
IEEE SYSTEMS JOURNAL, 2020, 14 (03) :3325-3333
[32]   Discrete-Time Optimal Control via Local Policy Iteration Adaptive Dynamic Programming [J].
Wei, Qinglai ;
Liu, Derong ;
Lin, Qiao ;
Song, Ruizhuo .
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (10) :3367-3379
[33]   Neural Network Based Online Simultaneous Policy Update Algorithm for Solving the HJI Equation in Nonlinear H∞ Control [J].
Wu, Huai-Ning ;
Luo, Biao .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (12) :1884-1895
[34]   Reinforcement learning-based formation-surrounding control for multiple quadrotor UAVs pursuit-evasion games [J].
Xiong, Hang ;
Zhang, Ying .
ISA TRANSACTIONS, 2024, 145 :205-224
[35]   Adaptive discrete-time controller design with neural network for hypersonic flight vehicle via back-stepping [J].
Xu, Bin ;
Sun, Fuchun ;
Yang, Chenguang ;
Gao, Daoxiang ;
Ren, Jianxin .
INTERNATIONAL JOURNAL OF CONTROL, 2011, 84 (09) :1543-1552
[36]   Nonsingular Predefined Time Adaptive Dynamic Surface Control for Quantized Nonlinear Systems [J].
Xu, Hao ;
Yu, Dengxiu ;
Wang, Zhen ;
Cheong, Kang Hao ;
Chen, C. L. Philip .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (09) :5567-5579
[37]   Observer-Based Fuzzy Adaptive Predefined Time Control for Uncertain Nonlinear Systems With Full-State Error Constraints [J].
Xu, Hao ;
Yu, Dengxiu ;
Liu, Yan-Jun .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (03) :1370-1382
[38]   Multiplayer Pursuit-Evasion Differential Games With Malicious Pursuers [J].
Xu, Yuhang ;
Yang, Hao ;
Jiang, Bin ;
Polycarpou, Marios M. .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (09) :4939-4946
[39]   Global consensus for discrete-time multi-agent systems with input saturation constraints [J].
Yang, Tao ;
Meng, Ziyang ;
Dimarogonas, Dimos V. ;
Johansson, Karl H. .
AUTOMATICA, 2014, 50 (02) :499-506
[40]   Adaptive Swarm Control Within Saturated Input Based on Nonlinear Coupling Degree [J].
Yu, Dengxiu ;
Long, Jia ;
Chen, C. L. Philip ;
Wang, Zhen .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (08) :4900-4911