Hybrid-Pursuit Strategies in Multiple Pursuer-Evader Games Using Reinforcement Learning

被引:0
|
作者
Guan, Yacun [1 ,2 ]
Xu, Wang [2 ]
Liu, Guohua [1 ]
机构
[1] Nankai Univ, Coll Elect Informat & Opt Engn, Tianjin 300350, Peoples R China
[2] Hangzhou Dianzi Univ, Sch Automat, Hangzhou 310018, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
基金
中国国家自然科学基金;
关键词
Training; Heuristic algorithms; Games; Image reconstruction; Safety; Optimization; Decoding; Collision avoidance; Vectors; Real-time systems; Multiple pursuer-evader; cooperative strategy; obstacle avoidance; reinforcement learning; EVASION GAME;
D O I
10.1109/ACCESS.2024.3514706
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a comprehensive learning strategy for the collaborative pursuit of evaders by multiple pursuers in environments with dynamic obstacles. Utilizing a variational autoencoder framework for effective obstacle detection, we integrate the multiagent twin delayed deep deterministic policy gradient algorithm for training pursuers and the proximal policy optimization algorithm for evaders, forming a complete pursuit-evasion strategy. In addition to collaborative pursuit strategies, our approach incorporates scheme for individual pursuers to directly capture nearby evaders, enhancing the flexibility and robustness of the overall system. The reward mechanism of these hybrid-pursuit strategies is designed to balance cooperative and individual rewards, informed by the states of both agents and obstacles, to optimize overall performance. Simulation results demonstrate the efficacy of the proposed algorithm, achieving successful collaborative and individual pursuits as well as dynamic obstacle avoidance.
引用
收藏
页码:187709 / 187721
页数:13
相关论文
共 50 条
  • [1] Decentralized strategy selection with learning automata for multiple pursuer-evader games
    Givigi, Sidney N., Jr.
    Schwartz, Howard M.
    ADAPTIVE BEHAVIOR, 2014, 22 (04) : 221 - 234
  • [2] Multiple Pursuer Multiple Evader Differential Games
    Garcia, Eloy
    Casbeer, David W.
    Von Moll, Alexander
    Pachter, Meir
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (05) : 2345 - 2350
  • [3] Fuzzy Reinforcement Learning Algorithm for the Pursuit-Evasion Differential Games with Superior Evader
    Al-Talabi, Ahmad A.
    2017 INTERNATIONAL AUTOMATIC CONTROL CONFERENCE (CACS), 2017,
  • [4] An Approach to Multi-Agent Pursuit Evasion Games Using Reinforcement Learning
    Bilgin, Ahmet Tunc
    Kadioglu-Urtis, Esra
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2015, : 164 - 169
  • [5] Pursuit-evasion games of multiple cooperative pursuers and an evader: A biological-inspired perspective
    Wang, Jianan
    Li, Guilu
    Liang, Li
    Wang, Chunyan
    Deng, Fang
    COMMUNICATIONS IN NONLINEAR SCIENCE AND NUMERICAL SIMULATION, 2022, 110
  • [6] Decentralized Multi-Agent Pursuit Using Deep Reinforcement Learning
    de Souza, Cristino, Jr.
    Newbury, Rhys
    Cosgun, Akansel
    Castillo, Pedro
    Vidolov, Boris
    Kulic, Dana
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (03): : 4552 - 4559
  • [7] Reinforcement learning-based formation-surrounding control for multiple quadrotor UAVs pursuit-evasion games
    Xiong, Hang
    Zhang, Ying
    ISA TRANSACTIONS, 2024, 145 : 205 - 224
  • [8] Pursuit-Evasion Games for Multi-agent Based on Reinforcement Learning with Obstacles
    Hu, Penglin
    Guo, Yaning
    Hu, Jinwen
    Pan, Quan
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 1015 - 1024
  • [9] Safety-Aware Pursuit-Evasion Games in Unknown Environments Using Gaussian Processes and Finite-Time Convergent Reinforcement Learning
    Kokolakis, Nikolaos-Marios T.
    Vamvoudakis, Kyriakos G.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3130 - 3143
  • [10] Optimal Group Consensus of Multiagent Systems in Graphical Games Using Reinforcement Learning
    Wang, Yuhan
    Wang, Zhuping
    Zhang, Hao
    Yan, Huaicheng
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2025, 55 (03): : 2343 - 2353