Game of Drones: Multi-UAV Pursuit-Evasion Game With Online Motion Planning by Deep Reinforcement Learning

被引：68

作者：

Zhang, Ruilong ^{[1
]}

Zong, Qun ^{[1
]}

Zhang, Xiuyun ^{[1
]}

Dou, Liqian ^{[1
]}

Tian, Bailing ^{[1
]}

机构：

[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2023年 / 34卷 / 10期

基金：

中国国家自然科学基金;

关键词：

Games; Reinforcement learning; Physics; Engines; Urban areas; Real-time systems; Trajectory; Multiagent reinforcement learning; multiquadcopter motion planning; pursuit-evasion game; trajectory prediction; PREDICTION; DESIGN; LEVEL;

D O I：

10.1109/TNNLS.2022.3146976

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As one of the tiniest flying objects, unmanned aerial vehicles (UAVs) are often expanded as the ``swarm'' to execute missions. In this article, we investigate the multiquadcopter and target pursuit-evasion game in the obstacles environment. For high-quality simulation of the urban environment, we propose the pursuit-evasion scenario (PES) framework to create the environment with a physics engine, which enables quadcopter agents to take actions and interact with the environment. On this basis, we construct multiagent coronal bidirectionally coordinated with target prediction network (CBC-TP Net) with a vectorized extension of multiagent deep deterministic policy gradient (MADDPG) formulation to ensure the effectiveness of the damaged ``swarm'' system in pursuit-evasion mission. Unlike traditional reinforcement learning, we design a target prediction network (TP Net) innovatively in the common framework to imitate the way of human thinking: situation prediction is always before decision-making. The experiments of the pursuit-evasion game are conducted to verify the state-of-the-art performance of the proposed strategy, both in the normal and antidamaged situations.

引用

页码：7900 / 7909

页数：10

共 50 条

[31] Terminal-guidance Based Reinforcement-learning for Orbital Pursuit-evasion Game of the Spacecraft
Geng Y.-Z.
Yuan L.
Huang H.
Tang L.
Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (05): : 974 - 984
[32] Integral reinforcement learning based dynamic stackelberg pursuit-evasion game for unmanned surface vehicles
Hu, Xiaoxiang
Liu, Shuaizheng
Xu, Jingwen
Xiao, Bing
Guo, Chenguang
ALEXANDRIA ENGINEERING JOURNAL, 2024, 108 : 428 - 435
[33] An open loop Stackelberg solution to optimal strategy for UAV pursuit-evasion game
Zhang, Yiqun
Zhang, Pengfei
Wang, Xiaodong
Song, Feng
Li, Chaoyong
Hao, Junhong
AEROSPACE SCIENCE AND TECHNOLOGY, 2022, 129
[34] The Game of Drones: rapid agent-based machine-learning models for multi-UAV path planning
Zohdi, T. I.
COMPUTATIONAL MECHANICS, 2020, 65 (01) : 217 - 228
[35] The Game of Drones: rapid agent-based machine-learning models for multi-UAV path planning
T. I. Zohdi
Computational Mechanics, 2020, 65 : 217 - 228
[36] Multi-Agent Pursuit-Evasion Game Based on Organizational Architecture
Souidi M.E.H.
Siam A.
Pei Z.
Piao S.
Journal of Computing and Information Technology, 2019, 27 (01) : 1 - 12
[37] Multi-Player Pursuit-Evasion Differential Game with Equal Speed
Al-Talabi, Ahmad A.
2017 INTERNATIONAL AUTOMATIC CONTROL CONFERENCE (CACS), 2017,
[38] A Two Stage Learning Technique for Dual Learning in the Pursuit-Evasion Differential Game
Al-Talabi, Ahmad A.
Schwartz, Howard M.
2014 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2014, : 243 - 250
[39] Probability Map Partitioning for Multi-player Pursuit-Evasion Game
Kwak, Dong Jun
Kim, H. Jin
INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 294 - 298
[40] Multi-UAV Assisted Offloading Optimization: A Game Combined Reinforcement Learning Approach
Gao, Ang
Wang, Qi
Chen, Kaiyue
Liang, Wei
IEEE COMMUNICATIONS LETTERS, 2021, 25 (08) : 2629 - 2633

← 1 2 3 4 5 →