A Learning-Based Semi-Autonomous Controller for Robotic Exploration of Unknown Disaster Scenes While Searching for Victims

被引:72
作者
Doroodgar, Barzin [1 ]
Liu, Yugang [1 ]
Nejat, Goldie [1 ]
机构
[1] Univ Toronto, Dept Mech & Ind Engn, Autonomous Syst & Biomechatron Lab, Toronto, ON M5S 3G8, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Hierarchical reinforcement learning; rescue robots; semi-autonomous control; urban search and rescue; URBAN SEARCH; RESCUE;
D O I
10.1109/TCYB.2014.2314294
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semi-autonomous control schemes can address the limitations of both teleoperation and fully autonomous robotic control of rescue robots in disaster environments by allowing a human operator to cooperate and share such tasks with a rescue robot as navigation, exploration, and victim identification. In this paper, we present a unique hierarchical reinforcement learning-based semi-autonomous control architecture for rescue robots operating in cluttered and unknown urban search and rescue (USAR) environments. The aim of the controller is to enable a rescue robot to continuously learn from its own experiences in an environment in order to improve its overall performance in exploration of unknown disaster scenes. A direction-based exploration technique is integrated in the controller to expand the search area of the robot via the classification of regions and the rubble piles within these regions. Both simulations and physical experiments in USAR-like environments verify the robustness of the proposed HRL-based semi-autonomous controller to unknown cluttered scenes with different sizes and varying types of configurations.
引用
收藏
页码:2719 / 2732
页数:14
相关论文
共 22 条
[1]  
Baker M, 2004, IEEE SYS MAN CYBERN, P2948
[2]  
Bruemmer DJ, 2004, IEEE SYS MAN CYBERN, P2847
[3]   Coordinated multi-robot exploration [J].
Burgard, W ;
Moors, M ;
Stachniss, C ;
Schneider, FE .
IEEE TRANSACTIONS ON ROBOTICS, 2005, 21 (03) :376-386
[4]   Human-robot interaction in USAR technical search: Two heads are better than one [J].
Burke, JL ;
Murphy, RR .
RO-MAN 2004: 13TH IEEE INTERNATIONAL WORKSHOP ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, PROCEEDINGS, 2004, :307-312
[5]   Moonlight in Miami: A field study of human-robot interaction in the context of an urban search and rescue disaster response training exercise [J].
Burke, JL ;
Murphy, RR ;
Coovert, MD ;
Riddle, DL .
HUMAN-COMPUTER INTERACTION, 2004, 19 (1-2) :85-116
[6]   Human-robot interactions during the robot-assisted urban search and rescue response at the World Trade Center [J].
Casper, J ;
Murphy, RR .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2003, 33 (03) :367-385
[7]  
Crandall JW, 2002, 2002 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-3, PROCEEDINGS, P1290, DOI 10.1109/IRDS.2002.1043932
[8]   Hierarchical reinforcement learning with the MAXQ value function decomposition [J].
Dietterich, TG .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2000, 13 :227-303
[9]  
Doroodgar Barzin, 2010, 2010 IEEE International Conference on Automation Science and Engineering (CASE 2010), P948, DOI 10.1109/COASE.2010.5584599
[10]   The Search for Survivors: Cooperative Human-Robot Interaction in Search and Rescue Environments using Semi-Autonomous Robots [J].
Doroodgar, Barzin ;
Ficocelli, Maurizio ;
Mobedi, Babak ;
Nejat, Goldie .
2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, :2858-2863