Approximate Optimal Strategy for Multiagent System Pursuit-Evasion Game

被引：4

作者：

Xu, Zhiqiang ^{[1
]}

Yu, Dengxiu ^{[2
]}

Liu, Yan-Jun ^{[3
]}

Wang, Zhen ^{[2
]}

机构：

[1] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Peoples R China

[2] Northwestern Polytech Univ, Sch Artificial Intelligence Opt & Elect, Xian 710072, Peoples R China

[3] Liaoning Univ Technol, Coll Sci, Jinzhou 121001, Peoples R China

来源：

IEEE SYSTEMS JOURNAL | 2024年 / 18卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Approximate optimal control; multiagent systems; pursuit-evasion games; reinforcement learning; CONSENSUS TRACKING;

D O I：

10.1109/JSYST.2024.3432796

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this article, we propose an approximate optimal control strategy for a class of nonlinear multiagent system pursuit-evasion games. Herein, multiple pursuers aim to capture multiple evaders trying to evade capture. Under the competitive framework, agents not only pursue their individual goals but also consider coordination with their teammates to achieve collective objectives. However, maintaining cohesion with teammates in existing distributed control methods has always been a challenge. To enhance team coordination, we employ a graph-theoretic approach to represent the relationships between agents. Based on this, we design a dynamic target graph algorithm to enhance the coordination among pursuers. The approximate optimal strategies for each agent are solved by utilizing the Hamilton-Jacobi-Isaacs equations of the system. As solving these equations becomes computationally intensive in multiagent scenarios, we propose a value-based single network adaptive critic network architecture. In addition, we consider scenarios where the numbers of agents on both sides are inconsistent and address the phenomenon of input saturation. Moreover, we provide sufficient conditions to prove the system's stability. Finally, simulations conducted in two representative scenarios, multiple-pursuer-one-evader and multiple-pursuer-multiple-evader, demonstrate the effectiveness of our proposed algorithm.

引用

页码：1669 / 1680

页数：12

共 50 条

[1]

Boron E., 2013, NONLINEAR FUNCTIONAL

[2] Distributed Differential Games for Control of Multi-Agent Systems [J].

Cappello, Domenico ;

Mylvaganam, Thulasi .

IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2022, 9 (02) :635-646

[3] Multi-player pursuit evasion games with one superior evader [J].

Chen, Jie ;

Zha, Wenzhong ;

Peng, Zhihong ;

Gu, Dongbing .

AUTOMATICA, 2016, 71 :24-32

[4] Adaptive Optimal Control via Q-Learning for Multi-Agent Pursuit-Evasion Games [J].

Dong, Xu ;

Zhang, Huaguang ;

Ming, Zhongyang .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (06) :3056-3060

[5]

Geering HansP., 2007, OPTIMAL CONTROL ENG, V113

[6] Pursuit-Evasion Game for Satellites Based on Continuous Thrust Reachable Domain [J].

Gong, Haoran ;

Gong, Shengping ;

Li, Junfeng .

IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2020, 56 (06) :4626-4637

[7]

Hoffman K., 2014, BANACH SPACES ANAL F

[8] AI World Cup: Robot-Soccer-Based Competitions [J].

Hong, Chansol ;

Jeong, Inbae ;

Vecchietti, Luiz Felipe ;

Har, Dongsoo ;

Kim, Jong-Hwan .

IEEE TRANSACTIONS ON GAMES, 2021, 13 (04) :330-341

[9] UNIVERSAL APPROXIMATION OF AN UNKNOWN MAPPING AND ITS DERIVATIVES USING MULTILAYER FEEDFORWARD NETWORKS [J].

HORNIK, K ;

STINCHCOMBE, M ;

WHITE, H .

NEURAL NETWORKS, 1990, 3 (05) :551-560

[10] Off-Policy Model-Free Learning for Multi-Player Non-Zero-Sum Games With Constrained Inputs [J].

Huo, Yu ;

Wang, Ding ;

Qiao, Junfei ;

Li, Menghua .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2023, 70 (02) :910-920

← 1 2 3 4 5 →