Hybrid-Pursuit Strategies in Multiple Pursuer-Evader Games Using Reinforcement Learning

被引:0
|
作者
Guan, Yacun [1 ,2 ]
Xu, Wang [2 ]
Liu, Guohua [1 ]
机构
[1] Nankai Univ, Coll Elect Informat & Opt Engn, Tianjin 300350, Peoples R China
[2] Hangzhou Dianzi Univ, Sch Automat, Hangzhou 310018, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
基金
中国国家自然科学基金;
关键词
Training; Heuristic algorithms; Games; Image reconstruction; Safety; Optimization; Decoding; Collision avoidance; Vectors; Real-time systems; Multiple pursuer-evader; cooperative strategy; obstacle avoidance; reinforcement learning; EVASION GAME;
D O I
10.1109/ACCESS.2024.3514706
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a comprehensive learning strategy for the collaborative pursuit of evaders by multiple pursuers in environments with dynamic obstacles. Utilizing a variational autoencoder framework for effective obstacle detection, we integrate the multiagent twin delayed deep deterministic policy gradient algorithm for training pursuers and the proximal policy optimization algorithm for evaders, forming a complete pursuit-evasion strategy. In addition to collaborative pursuit strategies, our approach incorporates scheme for individual pursuers to directly capture nearby evaders, enhancing the flexibility and robustness of the overall system. The reward mechanism of these hybrid-pursuit strategies is designed to balance cooperative and individual rewards, informed by the states of both agents and obstacles, to optimize overall performance. Simulation results demonstrate the efficacy of the proposed algorithm, achieving successful collaborative and individual pursuits as well as dynamic obstacle avoidance.
引用
收藏
页码:187709 / 187721
页数:13
相关论文
共 50 条
  • [21] Hybrid Path Tracking Control for Autonomous Trucks: Integrating Pure Pursuit and Deep Reinforcement Learning With Adaptive Look-Ahead Mechanism
    Han, Zhixuan
    Chen, Peng
    Zhou, Bin
    Yu, Guizhen
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025,
  • [22] Learning to compete, coordinate, and cooperate in repeated games using reinforcement learning
    Jacob W. Crandall
    Michael A. Goodrich
    Machine Learning, 2011, 82 : 281 - 314
  • [23] Understanding Decisions in Collective Risk Social Dilemma Games Using Reinforcement Learning
    Kumar, Medha
    Dutt, Varun
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2020, 12 (04) : 824 - 840
  • [24] Learning to compete, coordinate, and cooperate in repeated games using reinforcement learning
    Crandall, Jacob W.
    Goodrich, Michael A.
    MACHINE LEARNING, 2011, 82 (03) : 281 - 314
  • [25] Energy-Efficient Online Path Planning of Multiple Drones Using Reinforcement Learning
    Hong, Dooyoung
    Lee, Seonhoon
    Cho, Young Hoo
    Baek, Donkyu
    Kim, Jaemin
    Chang, Naehyuck
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (10) : 9725 - 9740
  • [26] Synthesis of Opacity-Enforcing Supervisory Strategies Using Reinforcement Learning
    Zhang, Huimin
    Huang, Li
    Huang, Wanling
    Feng, Lei
    Li, Xianxian
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 6896 - 6906
  • [27] Game Adaptation by Using Reinforcement Learning Over Meta Games
    Reis, Simao
    Reis, Luis Paulo
    Lau, Nuno
    GROUP DECISION AND NEGOTIATION, 2021, 30 (02) : 321 - 340
  • [28] Game Adaptation by Using Reinforcement Learning Over Meta Games
    Simão Reis
    Luís Paulo Reis
    Nuno Lau
    Group Decision and Negotiation, 2021, 30 : 321 - 340
  • [29] Machine Learning Approach for Multiple Coordinated Aerial Drones Pursuit-Evasion Games
    Al-Mahbashi, Ammar
    Schwartz, Howard
    Lambadaris, Ioannis
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 642 - 647
  • [30] A Hybrid Task Scheduling Technique in Fog Computing Using Fuzzy Logic and Deep Reinforcement Learning
    Choppara, Prashanth
    Mangalampalli, S. Sudheer
    IEEE ACCESS, 2024, 12 : 176363 - 176388