A Fully-Autonomous Aerial Robot for Search and Rescue Applications in Indoor Environments using Learning-Based Techniques

被引:139
作者
Sampedro, Carlos [1 ]
Rodriguez-Ramos, Alejandro [1 ]
Bavle, Hriday [1 ]
Carrio, Adrian [1 ]
de la Puente, Paloma [1 ]
Campoy, Pascual [1 ]
机构
[1] UPM, CSIC, Ctr Automat & Robot, Madrid, Spain
关键词
Autonomous robots; Search and rescue; Supervised learning; Reinforcement learning; Deep learning; Image-based visual servoing; UAV SYSTEM;
D O I
10.1007/s10846-018-0898-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Search and Rescue (SAR) missions represent an important challenge in the robotics research field as they usually involve exceedingly variable-nature scenarios which require a high-level of autonomy and versatile decision-making capabilities. This challenge becomes even more relevant in the case of aerial robotic platforms owing to their limited payload and computational capabilities. In this paper, we present a fully-autonomous aerial robotic solution, for executing complex SAR missions in unstructured indoor environments. The proposed system is based on the combination of a complete hardware configuration and a flexible system architecture which allows the execution of high-level missions in a fully unsupervised manner (i.e. without human intervention). In order to obtain flexible and versatile behaviors from the proposed aerial robot, several learning-based capabilities have been integrated for target recognition and interaction. The target recognition capability includes a supervised learning classifier based on a computationally-efficient Convolutional Neural Network (CNN) model trained for target/background classification, while the capability to interact with the target for rescue operations introduces a novel Image-Based Visual Servoing (IBVS) algorithm which integrates a recent deep reinforcement learning method named Deep Deterministic Policy Gradients (DDPG). In order to train the aerial robot for performing IBVS tasks, a reinforcement learning framework has been developed, which integrates a deep reinforcement learning agent (e.g. DDPG) with a Gazebo-based simulator for aerial robotics. The proposed system has been validated in a wide range of simulation flights, using Gazebo and PX4 Software-In-The-Loop, and real flights in cluttered indoor environments, demonstrating the versatility of the proposed system in complex SAR missions.
引用
收藏
页码:601 / 627
页数:27
相关论文
共 53 条
[1]   A remotely piloted aircraft system in major incident management: Concept and pilot, feasibility study [J].
Abrahamsen H.B. .
BMC Emergency Medicine, 15 (1)
[2]  
[Anonymous], 2004, TECH REP
[3]  
[Anonymous], 2011, RAPID COMMUN MASS SP
[4]  
[Anonymous], IEEE INT CONF ROBOT
[5]  
[Anonymous], 1 S IND FLIGHT 2009
[6]  
[Anonymous], PROC CVPR IEEE
[7]  
[Anonymous], ADV NEURAL INFORM PR, DOI DOI 10.1109/TPAMI.2016.2577031
[8]  
[Anonymous], ADV NEURAL INFORM PR
[9]  
[Anonymous], 2017, ARXIV170311000
[10]  
[Anonymous], P 13 INT C INT C INT