Game of Drones: Multi-UAV Pursuit-Evasion Game With Online Motion Planning by Deep Reinforcement Learning

被引:68
|
作者
Zhang, Ruilong [1 ]
Zong, Qun [1 ]
Zhang, Xiuyun [1 ]
Dou, Liqian [1 ]
Tian, Bailing [1 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金;
关键词
Games; Reinforcement learning; Physics; Engines; Urban areas; Real-time systems; Trajectory; Multiagent reinforcement learning; multiquadcopter motion planning; pursuit-evasion game; trajectory prediction; PREDICTION; DESIGN; LEVEL;
D O I
10.1109/TNNLS.2022.3146976
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As one of the tiniest flying objects, unmanned aerial vehicles (UAVs) are often expanded as the ``swarm'' to execute missions. In this article, we investigate the multiquadcopter and target pursuit-evasion game in the obstacles environment. For high-quality simulation of the urban environment, we propose the pursuit-evasion scenario (PES) framework to create the environment with a physics engine, which enables quadcopter agents to take actions and interact with the environment. On this basis, we construct multiagent coronal bidirectionally coordinated with target prediction network (CBC-TP Net) with a vectorized extension of multiagent deep deterministic policy gradient (MADDPG) formulation to ensure the effectiveness of the damaged ``swarm'' system in pursuit-evasion mission. Unlike traditional reinforcement learning, we design a target prediction network (TP Net) innovatively in the common framework to imitate the way of human thinking: situation prediction is always before decision-making. The experiments of the pursuit-evasion game are conducted to verify the state-of-the-art performance of the proposed strategy, both in the normal and antidamaged situations.
引用
收藏
页码:7900 / 7909
页数:10
相关论文
共 50 条
  • [31] Terminal-guidance Based Reinforcement-learning for Orbital Pursuit-evasion Game of the Spacecraft
    Geng Y.-Z.
    Yuan L.
    Huang H.
    Tang L.
    Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (05): : 974 - 984
  • [32] Integral reinforcement learning based dynamic stackelberg pursuit-evasion game for unmanned surface vehicles
    Hu, Xiaoxiang
    Liu, Shuaizheng
    Xu, Jingwen
    Xiao, Bing
    Guo, Chenguang
    ALEXANDRIA ENGINEERING JOURNAL, 2024, 108 : 428 - 435
  • [33] An open loop Stackelberg solution to optimal strategy for UAV pursuit-evasion game
    Zhang, Yiqun
    Zhang, Pengfei
    Wang, Xiaodong
    Song, Feng
    Li, Chaoyong
    Hao, Junhong
    AEROSPACE SCIENCE AND TECHNOLOGY, 2022, 129
  • [34] The Game of Drones: rapid agent-based machine-learning models for multi-UAV path planning
    Zohdi, T. I.
    COMPUTATIONAL MECHANICS, 2020, 65 (01) : 217 - 228
  • [35] The Game of Drones: rapid agent-based machine-learning models for multi-UAV path planning
    T. I. Zohdi
    Computational Mechanics, 2020, 65 : 217 - 228
  • [36] Multi-Agent Pursuit-Evasion Game Based on Organizational Architecture
    Souidi M.E.H.
    Siam A.
    Pei Z.
    Piao S.
    Journal of Computing and Information Technology, 2019, 27 (01) : 1 - 12
  • [37] Multi-Player Pursuit-Evasion Differential Game with Equal Speed
    Al-Talabi, Ahmad A.
    2017 INTERNATIONAL AUTOMATIC CONTROL CONFERENCE (CACS), 2017,
  • [38] A Two Stage Learning Technique for Dual Learning in the Pursuit-Evasion Differential Game
    Al-Talabi, Ahmad A.
    Schwartz, Howard M.
    2014 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2014, : 243 - 250
  • [39] Probability Map Partitioning for Multi-player Pursuit-Evasion Game
    Kwak, Dong Jun
    Kim, H. Jin
    INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 294 - 298
  • [40] Multi-UAV Assisted Offloading Optimization: A Game Combined Reinforcement Learning Approach
    Gao, Ang
    Wang, Qi
    Chen, Kaiyue
    Liang, Wei
    IEEE COMMUNICATIONS LETTERS, 2021, 25 (08) : 2629 - 2633