Cooperative pursuit of unauthorized UAVs in urban airspace via Multi-agent reinforcement learning

被引:45
作者
Du, Wenbo [1 ,2 ]
Guo, Tong [1 ,2 ]
Chen, Jun [3 ]
Li, Biyue [1 ,2 ]
Zhu, Guangxiang [4 ]
Cao, Xianbin [1 ,2 ]
机构
[1] Beihang Univ, Sch Elect & Informat Engn, Beijing 100083, Peoples R China
[2] Natl Engn Lab Big Data Applicat Technol Comprehen, Beijing 100083, Peoples R China
[3] Queen Mary Univ London, Sch Engn & Mat Sci, Mile End Rd, London E1 4NS, England
[4] Tsinghua Univ, Inst Interdisciplinary Informat Sci, Beijing 100084, Peoples R China
基金
中国国家自然科学基金; 英国工程与自然科学研究理事会;
关键词
Urban Air Mobility (UAM); Unmanned Aerial Vehicle (UAV); Multi-agent Reinforcement Learning (MARL); SURVEILLANCE-EVASION; DIFFERENTIAL GAME;
D O I
10.1016/j.trc.2021.103122
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Urban Air Mobility (UAM) is an emergent concept for future air transportation. With UAM, cargo and passengers will be transported on-demand in urban airspace. UAM has shown a promising prospect in mitigating ground congestion and providing people with an alternative mobility option. However, unauthorized unmanned aerial vehicles (UAVs) in urban airspace present a significant threat to safety of UAM, drawing significant attention from research communities recently. Among all solutions, cooperative pursuit using a team of UAVs is an effective countermeasure for unauthorized UAVs in urban airspace. In this paper, we model cooperative pursuit as a pursuit-evasion game problem (PEG) and propose a multi-agent reinforcement learning (MARL) based approach to solve the problem efficiently. The proposed approach incorporates novel cellular-enabled parameter sharing and curriculum learning schemes to enhance the capability of pursuer UAVs in capturing faster unauthorized UAVs in urban airspace. Extensive experiments have been conducted using simulated urban airspace in order to evaluate the performance of the proposed method. Experimental results demonstrate that by incorporating the parameter sharing scheme, the proposed methods provide much higher capturing rates in a shorter time. Such superiority is more evident when communication constraints are more stringent and/or unauthorized UAVs are faster.
引用
收藏
页数:17
相关论文
共 48 条
[1]  
Abichandani P., 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2011), P2306, DOI 10.1109/IROS.2011.6048822
[2]  
Al-Talabi A A, 2017, 2017 INT AUT CONTR C, P1
[3]  
Awheda Mostafa D., 2016, 2016 Annual IEEE Systems Conference (SysCon). Proceedings, DOI 10.1109/SYSCON.2016.7490516
[4]   Optimal Pursuit of Moving Targets using Dynamic Voronoi Diagrams [J].
Bakolas, Efstathios ;
Tsiotras, Panagiotis .
49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, :7431-7436
[5]  
Bertram J., 2020, P AIAA SCIT 2020 FOR
[6]  
Bilgin AT, 2015, PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), P164, DOI 10.1109/ICAR.2015.7251450
[7]   On Discrete-Time Pursuit-Evasion Games With Sensing Limitations [J].
Bopardikar, Shatmak D. ;
Bullo, Francesco ;
Hespanha, Joao P. .
IEEE TRANSACTIONS ON ROBOTICS, 2008, 24 (06) :1429-1439
[8]   A cooperative Homicidal Chauffeur game [J].
Bopardikar, Shaunak D. ;
Bullo, Francesco ;
Hespanha, Joao P. .
AUTOMATICA, 2009, 45 (07) :1771-1777
[9]   A comprehensive survey of multiagent reinforcement learning [J].
Busoniu, Lucian ;
Babuska, Robert ;
De Schutter, Bart .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (02) :156-172
[10]   Multi-player pursuit evasion games with one superior evader [J].
Chen, Jie ;
Zha, Wenzhong ;
Peng, Zhihong ;
Gu, Dongbing .
AUTOMATICA, 2016, 71 :24-32