Hybrid-Pursuit Strategies in Multiple Pursuer-Evader Games Using Reinforcement Learning

被引：0

作者：

Guan, Yacun ^{[1
,2
]}

Xu, Wang ^{[2
]}

Liu, Guohua ^{[1
]}

机构：

[1] Nankai Univ, Coll Elect Informat & Opt Engn, Tianjin 300350, Peoples R China

[2] Hangzhou Dianzi Univ, Sch Automat, Hangzhou 310018, Peoples R China

来源：

IEEE ACCESS | 2024年 / 12卷

基金：

中国国家自然科学基金;

关键词：

Training; Heuristic algorithms; Games; Image reconstruction; Safety; Optimization; Decoding; Collision avoidance; Vectors; Real-time systems; Multiple pursuer-evader; cooperative strategy; obstacle avoidance; reinforcement learning; EVASION GAME;

D O I：

10.1109/ACCESS.2024.3514706

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a comprehensive learning strategy for the collaborative pursuit of evaders by multiple pursuers in environments with dynamic obstacles. Utilizing a variational autoencoder framework for effective obstacle detection, we integrate the multiagent twin delayed deep deterministic policy gradient algorithm for training pursuers and the proximal policy optimization algorithm for evaders, forming a complete pursuit-evasion strategy. In addition to collaborative pursuit strategies, our approach incorporates scheme for individual pursuers to directly capture nearby evaders, enhancing the flexibility and robustness of the overall system. The reward mechanism of these hybrid-pursuit strategies is designed to balance cooperative and individual rewards, informed by the states of both agents and obstacles, to optimize overall performance. Simulation results demonstrate the efficacy of the proposed algorithm, achieving successful collaborative and individual pursuits as well as dynamic obstacle avoidance.

引用

页码：187709 / 187721

页数：13

共 50 条

[31] Learning Strategies of Inductive Logic Programming Using Reinforcement Learning
Isobe, Takeru
Inoue, Katsumi
INDUCTIVE LOGIC PROGRAMMING, ILP 2023, 2023, 14363 : 46 - 61
[32] Decision Making in Monopoly Using a Hybrid Deep Reinforcement Learning Approach
Bonjour, Trevor
Haliem, Marina
Alsalem, Aala
Thomas, Shilpa
Li, Hongyu
Aggarwal, Vaneet
Kejriwal, Mayank
Bhargava, Bharat
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (06): : 1335 - 1344
[33] Optimizing Reinforcement Learning Agents in Games Using Curriculum Learning and Reward Shaping
Khan, Adil
Muhammad, Muhammad
Naeem, Muhammad
COMPUTER ANIMATION AND VIRTUAL WORLDS, 2025, 36 (01)
[34] Integrating Multiple Policies for Person-Following Robot Training Using Deep Reinforcement Learning
Dewa, Chandra Kusuma
Miura, Jun
IEEE ACCESS, 2021, 9 : 75526 - 75541
[35] Convergent multiple-timescales reinforcement learning algorithms in normal form games
Leslie, DS
Collins, EJ
ANNALS OF APPLIED PROBABILITY, 2003, 13 (04) : 1231 - 1251
[36] Adapting attackers and defenders patrolling strategies: A reinforcement learning approach for Stackelberg security games
Trejo, Kristal K.
Clempner, Julio B.
Poznyak, Alexander S.
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2018, 95 : 35 - 54
[37] Differential evolution with hybrid parameters and mutation strategies based on reinforcement learning
Tan, Zhiping
Tang, Yu
Li, Kangshun
Huang, Huasheng
Luo, Shaoming
SWARM AND EVOLUTIONARY COMPUTATION, 2022, 75
[38] Pursuit-evasion game with online planning using deep reinforcement learning
Chen, Yong
Shi, Yu
Dai, Xunhua
Meng, Qing
Yu, Tao
APPLIED INTELLIGENCE, 2025, 55 (06)
[39] State of the Art Control of Atari Games Using Shallow Reinforcement Learning
Liang, Yitao
Machado, Marlos C.
Talvitie, Erik
Bowling, Michael
AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 485 - 493
[40] Orbital Interception Pursuit Strategy for Random Evasion Using Deep Reinforcement Learning
Jiang, Rui
Ye, Dong
Xiao, Yan
Sun, Zhaowei
Zhang, Zeming
SPACE: SCIENCE & TECHNOLOGY, 2023, 3

← 1 2 3 4 5 →