Hybrid-Pursuit Strategies in Multiple Pursuer-Evader Games Using Reinforcement Learning

被引:0
作者
Guan, Yacun [1 ,2 ]
Xu, Wang [2 ]
Liu, Guohua [1 ]
机构
[1] Nankai Univ, Coll Elect Informat & Opt Engn, Tianjin 300350, Peoples R China
[2] Hangzhou Dianzi Univ, Sch Automat, Hangzhou 310018, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
基金
中国国家自然科学基金;
关键词
Training; Heuristic algorithms; Games; Image reconstruction; Safety; Optimization; Decoding; Collision avoidance; Vectors; Real-time systems; Multiple pursuer-evader; cooperative strategy; obstacle avoidance; reinforcement learning; EVASION GAME;
D O I
10.1109/ACCESS.2024.3514706
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a comprehensive learning strategy for the collaborative pursuit of evaders by multiple pursuers in environments with dynamic obstacles. Utilizing a variational autoencoder framework for effective obstacle detection, we integrate the multiagent twin delayed deep deterministic policy gradient algorithm for training pursuers and the proximal policy optimization algorithm for evaders, forming a complete pursuit-evasion strategy. In addition to collaborative pursuit strategies, our approach incorporates scheme for individual pursuers to directly capture nearby evaders, enhancing the flexibility and robustness of the overall system. The reward mechanism of these hybrid-pursuit strategies is designed to balance cooperative and individual rewards, informed by the states of both agents and obstacles, to optimize overall performance. Simulation results demonstrate the efficacy of the proposed algorithm, achieving successful collaborative and individual pursuits as well as dynamic obstacle avoidance.
引用
收藏
页码:187709 / 187721
页数:13
相关论文
共 50 条
  • [31] Learning Strategies of Inductive Logic Programming Using Reinforcement Learning
    Isobe, Takeru
    Inoue, Katsumi
    INDUCTIVE LOGIC PROGRAMMING, ILP 2023, 2023, 14363 : 46 - 61
  • [32] Decision Making in Monopoly Using a Hybrid Deep Reinforcement Learning Approach
    Bonjour, Trevor
    Haliem, Marina
    Alsalem, Aala
    Thomas, Shilpa
    Li, Hongyu
    Aggarwal, Vaneet
    Kejriwal, Mayank
    Bhargava, Bharat
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (06): : 1335 - 1344
  • [33] Optimizing Reinforcement Learning Agents in Games Using Curriculum Learning and Reward Shaping
    Khan, Adil
    Muhammad, Muhammad
    Naeem, Muhammad
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2025, 36 (01)
  • [34] Integrating Multiple Policies for Person-Following Robot Training Using Deep Reinforcement Learning
    Dewa, Chandra Kusuma
    Miura, Jun
    IEEE ACCESS, 2021, 9 : 75526 - 75541
  • [35] Convergent multiple-timescales reinforcement learning algorithms in normal form games
    Leslie, DS
    Collins, EJ
    ANNALS OF APPLIED PROBABILITY, 2003, 13 (04) : 1231 - 1251
  • [36] Adapting attackers and defenders patrolling strategies: A reinforcement learning approach for Stackelberg security games
    Trejo, Kristal K.
    Clempner, Julio B.
    Poznyak, Alexander S.
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2018, 95 : 35 - 54
  • [37] Differential evolution with hybrid parameters and mutation strategies based on reinforcement learning
    Tan, Zhiping
    Tang, Yu
    Li, Kangshun
    Huang, Huasheng
    Luo, Shaoming
    SWARM AND EVOLUTIONARY COMPUTATION, 2022, 75
  • [38] Pursuit-evasion game with online planning using deep reinforcement learning
    Chen, Yong
    Shi, Yu
    Dai, Xunhua
    Meng, Qing
    Yu, Tao
    APPLIED INTELLIGENCE, 2025, 55 (06)
  • [39] State of the Art Control of Atari Games Using Shallow Reinforcement Learning
    Liang, Yitao
    Machado, Marlos C.
    Talvitie, Erik
    Bowling, Michael
    AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 485 - 493
  • [40] Orbital Interception Pursuit Strategy for Random Evasion Using Deep Reinforcement Learning
    Jiang, Rui
    Ye, Dong
    Xiao, Yan
    Sun, Zhaowei
    Zhang, Zeming
    SPACE: SCIENCE & TECHNOLOGY, 2023, 3