Application of Deep Reinforcement Learning to UAV Fleet Control

被引：2

作者：

Tozicka, Jan ^{[1
]}

Szulyovszky, Benedek ^{[1
]}

de Chambrier, Guillaume ^{[1
]}

Sarwal, Varun ^{[1
]}

Wani, Umar ^{[1
]}

Gribulis, Mantas ^{[1
]}

机构：

[1] Accelerated Dynam X, London, England

来源：

INTELLIGENT SYSTEMS AND APPLICATIONS, INTELLISYS, VOL 2 | 2019年 / 869卷

关键词：

Swarm robotics; UAV; Deep reinforcement learning;

D O I：

10.1007/978-3-030-01057-7_85

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The increasing presence of robots and unmanned systems as a result of technological breakthroughs and falling costs has increased the demand for robust and scalable multi-agent control systems. We developed a multi-UAV fleet control system based on recent findings in the deep reinforcement learning literature. A deep convolutional neural network with a linear output layer is chosen as control policy, due to its wide spread applicability, and is trained, in simulation, for two tasks: aerial surveillance and base defense, with five UAVs. The generalization power of the architecture with respect to different fleet sizes was evaluated. For both tasks, at test time, we varied the number of UAVs from one to ten and we found that for all settings the policy was able to accomplish both tasks robustly. We deployed the control policy on a fleet of five DJI Mavic Pro drones and found that it performed well.

引用

页码：1169 / 1177

页数：9