Deep Reinforcement Learning Robot for Search and Rescue Applications: Exploration in Unknown Cluttered Environments

被引:243
作者
Niroui, Farzad [1 ]
Zhang, Kaicheng [1 ]
Kashino, Zendai [1 ]
Nejat, Goldie [1 ]
机构
[1] Univ Toronto, Dept Mech & Ind Engn, Autonomous Syst & Biomechatron Lab, Toronto, ON M5S 3G8, Canada
关键词
Autonomous agents; deep learning in robotics and automation; search and rescue robots;
D O I
10.1109/LRA.2019.2891991
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Rescue robots can be used in urban search and rescue (USAR) applications to perform the important task of exploring unknown cluttered environments. Due to the unpredictable nature of these environments, deep learning techniques can be used to perform these tasks. In this letter, we present the first use of deep learning to address the robot exploration task in USAR applications. In particular, we uniquely combine the traditional approach of frontier-based exploration with deep reinforcement learning to allow a robot to autonomously explore unknown cluttered environments. Experiments conducted with a mobile robot in unknown cluttered environments of varying sizes and layouts showed that the proposed exploration approach can effectively determine appropriate frontier locations to navigate to, while being robust to different environment layouts and sizes. Furthermore, a comparison study with other frontier exploration approaches showed that our learning-based frontier exploration technique was able to explore more of an environment earlier on, allowing for potential identification of a larger number of victims at the beginning of the time-critical exploration task.
引用
收藏
页码:610 / 617
页数:8
相关论文
共 29 条
[1]   A Hybrid Geometric Spatial Image Representation for scene classification [J].
Ali, Nouman ;
Zafar, Bushra ;
Riaz, Faisal ;
Dar, Saadat Hanif ;
Ratyal, Naeem Iqbal ;
Bajwa, Khalid Bashir ;
Iqba, Muhammad Kashif ;
Sajid, Muhammad .
PLOS ONE, 2018, 13 (09)
[2]   Image retrieval by addition of spatial information based on histograms of triangular regions [J].
Ali, Nouman ;
Bajwa, Khalid Bashir ;
Sablatnig, Robert ;
Mehmood, Zahid .
COMPUTERS & ELECTRICAL ENGINEERING, 2016, 54 :539-550
[3]   A Novel Image Retrieval Based on Visual Words Integration of SIFT and SURF [J].
Ali, Nouman ;
Bajwa, Khalid Bashir ;
Sablatnig, Robert ;
Chatzichristofis, Savvas A. ;
Iqbal, Zeshan ;
Rashid, Muhammad ;
Habib, Hafiz Adnan .
PLOS ONE, 2016, 11 (06)
[4]  
[Anonymous], 2016, PROC INT C MACH LEAR
[5]  
[Anonymous], ARXIV170609520CS
[6]  
[Anonymous], P IEEE INT WORKSH SA
[7]  
[Anonymous], ARXIV170404651CS
[8]   Deep Reinforcement Learning A brief survey [J].
Arulkumaran, Kai ;
Deisenroth, Marc Peter ;
Brundage, Miles ;
Bharath, Anil Anthony .
IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38
[9]  
Barfoot T., 2017, STATE ESTIMATION ROB
[10]   Exploration strategies based on multi-criteria decision making for searching environments in rescue operations [J].
Basilico, Nicola ;
Amigoni, Francesco .
AUTONOMOUS ROBOTS, 2011, 31 (04) :401-417