Deep Reinforcement Learning Robot for Search and Rescue Applications: Exploration in Unknown Cluttered Environments

被引:243
作者
Niroui, Farzad [1 ]
Zhang, Kaicheng [1 ]
Kashino, Zendai [1 ]
Nejat, Goldie [1 ]
机构
[1] Univ Toronto, Dept Mech & Ind Engn, Autonomous Syst & Biomechatron Lab, Toronto, ON M5S 3G8, Canada
关键词
Autonomous agents; deep learning in robotics and automation; search and rescue robots;
D O I
10.1109/LRA.2019.2891991
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Rescue robots can be used in urban search and rescue (USAR) applications to perform the important task of exploring unknown cluttered environments. Due to the unpredictable nature of these environments, deep learning techniques can be used to perform these tasks. In this letter, we present the first use of deep learning to address the robot exploration task in USAR applications. In particular, we uniquely combine the traditional approach of frontier-based exploration with deep reinforcement learning to allow a robot to autonomously explore unknown cluttered environments. Experiments conducted with a mobile robot in unknown cluttered environments of varying sizes and layouts showed that the proposed exploration approach can effectively determine appropriate frontier locations to navigate to, while being robust to different environment layouts and sizes. Furthermore, a comparison study with other frontier exploration approaches showed that our learning-based frontier exploration technique was able to explore more of an environment earlier on, allowing for potential identification of a larger number of victims at the beginning of the time-critical exploration task.
引用
收藏
页码:610 / 617
页数:8
相关论文
共 29 条
[21]   A Fully-Autonomous Aerial Robot for Search and Rescue Applications in Indoor Environments using Learning-Based Techniques [J].
Sampedro, Carlos ;
Rodriguez-Ramos, Alejandro ;
Bavle, Hriday ;
Carrio, Adrian ;
de la Puente, Paloma ;
Campoy, Pascual .
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2019, 95 (02) :601-627
[22]  
Tai Lei, 2016, 2016 IEEE International Conference on Real-Time Computing and Robotics (RCAR). Proceedings, P57, DOI 10.1109/RCAR.2016.7784001
[23]   ROBOT ARM PERCEPTIVE EXPLORATION BASED SIGNIFICANT SLAM IN SEARCH AND RESCUE ENVIRONMENT [J].
Wang, Hongling ;
Zhang, Chengjin ;
Song, Yong ;
Pang, Bao .
INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2018, 33 (04) :394-406
[24]  
Wang RZ, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SAFETY FOR ROBOTICS (ISR), P267, DOI 10.1109/IISR.2018.8535823
[25]   End-to-end, sequence-to-sequence probabilistic visual odometry through deep neural networks [J].
Wang, Sen ;
Clark, Ronald ;
Wen, Hongkai ;
Trigoni, Niki .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2018, 37 (4-5) :513-542
[26]  
Wang S, 2017, INT CONF ACOUST SPEE, P436, DOI 10.1109/ICASSP.2017.7952193
[27]   A frontier-based approach for autonomous exploration [J].
Yamauchi, B .
1997 IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN ROBOTICS AND AUTOMATION - CIRA '97, PROCEEDINGS: TOWARDS NEW COMPUTATIONAL PRINCIPLES FOR ROBOTICS AND AUTOMATION, 1997, :146-151
[28]   Image classification by addition of spatial information based on histograms of orthogonal vectors [J].
Zafar, Bushra ;
Ashraf, Rehan ;
Ali, Nouman ;
Ahmed, Mudassar ;
Jabber, Sohail ;
Chatzichristofis, Savvas A. .
PLOS ONE, 2018, 13 (06)
[29]  
Zhang KC, 2018, IEEE INT SYMP SAFE