Hybrid-Pursuit Strategies in Multiple Pursuer-Evader Games Using Reinforcement Learning

被引：0

作者：

Guan, Yacun ^{[1
,2
]}

Xu, Wang ^{[2
]}

Liu, Guohua ^{[1
]}

机构：

[1] Nankai Univ, Coll Elect Informat & Opt Engn, Tianjin 300350, Peoples R China

[2] Hangzhou Dianzi Univ, Sch Automat, Hangzhou 310018, Peoples R China

来源：

IEEE ACCESS | 2024年 / 12卷

基金：

中国国家自然科学基金;

关键词：

Training; Heuristic algorithms; Games; Image reconstruction; Safety; Optimization; Decoding; Collision avoidance; Vectors; Real-time systems; Multiple pursuer-evader; cooperative strategy; obstacle avoidance; reinforcement learning; EVASION GAME;

D O I：

10.1109/ACCESS.2024.3514706

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a comprehensive learning strategy for the collaborative pursuit of evaders by multiple pursuers in environments with dynamic obstacles. Utilizing a variational autoencoder framework for effective obstacle detection, we integrate the multiagent twin delayed deep deterministic policy gradient algorithm for training pursuers and the proximal policy optimization algorithm for evaders, forming a complete pursuit-evasion strategy. In addition to collaborative pursuit strategies, our approach incorporates scheme for individual pursuers to directly capture nearby evaders, enhancing the flexibility and robustness of the overall system. The reward mechanism of these hybrid-pursuit strategies is designed to balance cooperative and individual rewards, informed by the states of both agents and obstacles, to optimize overall performance. Simulation results demonstrate the efficacy of the proposed algorithm, achieving successful collaborative and individual pursuits as well as dynamic obstacle avoidance.

引用

页码：187709 / 187721

页数：13

共 50 条

[1] Decentralized strategy selection with learning automata for multiple pursuer-evader games
Givigi, Sidney N., Jr.
Schwartz, Howard M.
ADAPTIVE BEHAVIOR, 2014, 22 (04) : 221 - 234
[2] Multiple Pursuer Multiple Evader Differential Games
Garcia, Eloy
Casbeer, David W.
Von Moll, Alexander
Pachter, Meir
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (05) : 2345 - 2350
[3] Fuzzy Reinforcement Learning Algorithm for the Pursuit-Evasion Differential Games with Superior Evader
Al-Talabi, Ahmad A.
2017 INTERNATIONAL AUTOMATIC CONTROL CONFERENCE (CACS), 2017,
[4] An Approach to Multi-Agent Pursuit Evasion Games Using Reinforcement Learning
Bilgin, Ahmet Tunc
Kadioglu-Urtis, Esra
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2015, : 164 - 169
[5] Pursuit-evasion games of multiple cooperative pursuers and an evader: A biological-inspired perspective
Wang, Jianan
Li, Guilu
Liang, Li
Wang, Chunyan
Deng, Fang
COMMUNICATIONS IN NONLINEAR SCIENCE AND NUMERICAL SIMULATION, 2022, 110
[6] Decentralized Multi-Agent Pursuit Using Deep Reinforcement Learning
de Souza, Cristino, Jr.
Newbury, Rhys
Cosgun, Akansel
Castillo, Pedro
Vidolov, Boris
Kulic, Dana
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (03): : 4552 - 4559
[7] Reinforcement learning-based formation-surrounding control for multiple quadrotor UAVs pursuit-evasion games
Xiong, Hang
Zhang, Ying
ISA TRANSACTIONS, 2024, 145 : 205 - 224
[8] Pursuit-Evasion Games for Multi-agent Based on Reinforcement Learning with Obstacles
Hu, Penglin
Guo, Yaning
Hu, Jinwen
Pan, Quan
PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 1015 - 1024
[9] Safety-Aware Pursuit-Evasion Games in Unknown Environments Using Gaussian Processes and Finite-Time Convergent Reinforcement Learning
Kokolakis, Nikolaos-Marios T.
Vamvoudakis, Kyriakos G.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3130 - 3143
[10] Optimal Group Consensus of Multiagent Systems in Graphical Games Using Reinforcement Learning
Wang, Yuhan
Wang, Zhuping
Zhang, Hao
Yan, Huaicheng
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2025, 55 (03): : 2343 - 2353

← 1 2 3 4 5 →