Application of Deep Reinforcement Learning to UAV Fleet Control

被引:2
作者
Tozicka, Jan [1 ]
Szulyovszky, Benedek [1 ]
de Chambrier, Guillaume [1 ]
Sarwal, Varun [1 ]
Wani, Umar [1 ]
Gribulis, Mantas [1 ]
机构
[1] Accelerated Dynam X, London, England
来源
INTELLIGENT SYSTEMS AND APPLICATIONS, INTELLISYS, VOL 2 | 2019年 / 869卷
关键词
Swarm robotics; UAV; Deep reinforcement learning;
D O I
10.1007/978-3-030-01057-7_85
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increasing presence of robots and unmanned systems as a result of technological breakthroughs and falling costs has increased the demand for robust and scalable multi-agent control systems. We developed a multi-UAV fleet control system based on recent findings in the deep reinforcement learning literature. A deep convolutional neural network with a linear output layer is chosen as control policy, due to its wide spread applicability, and is trained, in simulation, for two tasks: aerial surveillance and base defense, with five UAVs. The generalization power of the architecture with respect to different fleet sizes was evaluated. For both tasks, at test time, we varied the number of UAVs from one to ten and we found that for all settings the policy was able to accomplish both tasks robustly. We deployed the control policy on a fleet of five DJI Mavic Pro drones and found that it performed well.
引用
收藏
页码:1169 / 1177
页数:9
相关论文
共 8 条
[1]  
[Anonymous], 2017, AUTONOMOUS ROBOTS PL
[2]  
[Anonymous], 2017, CORR
[3]  
Bonkenburg T., 2016, TECHNICAL REPORT
[4]  
Brockman G., 2016, OPENAI GYM
[5]  
Foerster Jakob, 2017, Counterfactual multi-agent policy gradients
[6]  
Insitu, 2017, ESR JOIN FORC FIGHT
[7]  
Olson P., OCADO IS BUILDING RO
[8]  
Wang ZY, 2016, PR MACH LEARN RES, V48