Confidence-Based Multi-Robot Learning from Demonstration

被引:22
作者
Chernova, Sonia [1 ]
Veloso, Manuela [1 ]
机构
[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15217 USA
关键词
Learning from demonstration; Multi-robot learning; Human-robot interaction; Multi-robot systems; SYSTEMS; COORDINATION;
D O I
10.1007/s12369-010-0060-0
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Learning from demonstration algorithms enable a robot to learn a new policy based on demonstrations provided by a teacher. In this article, we explore a novel research direction, multi-robot learning from demonstration, which extends demonstration based learning methods to collaborative multi-robot domains. Specifically, we study the problem of enabling a single person to teach individual policies to multiple robots at the same time. We present flexMLfD, a task and platform independent multirobot demonstration learning framework that supports both independent and collaborative multi-robot behaviors. Building upon this framework, we contribute three approaches to teaching collaborative multi-robot behaviors based on different information sharing strategies, and evaluate these approaches by teaching two Sony QRIO humanoid robots to perform three collaborative ball sorting tasks. We then present scalability analysis of flexMLfD using up to seven Sony AIBO robots. We conclude the article by proposing a formalization for a broader multi-robot learning from demonstration research area.
引用
收藏
页码:195 / 215
页数:21
相关论文
共 50 条
[11]  
Breazeal C., 2004, AAMAS, V4, P1030, DOI DOI 10.1109/AAMAS.2004.258
[12]  
BROWNING B, 2004, P 19 NAT C ART INT A
[13]  
Calinon S, 2007, 2 ANN C HUM ROB INT
[14]  
Chaimowicz L, 2002, 2002 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS I-IV, PROCEEDINGS, P293, DOI 10.1109/ROBOT.2002.1013376
[15]  
CHERNOVA S, 2008, P INT C AUT AG MULT
[16]  
Chernova S, 2009, THESIS CARNEGIE MELL
[17]   Interactive Policy Learning through Confidence-Based Autonomy [J].
Chernova, Sonia ;
Veloso, Manuela .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2009, 34 :1-25
[18]  
Clouse J., 1996, THESIS U MASSACHUSET
[19]   Validating human-robot interaction schemes in multitasking environments [J].
Crandall, JW ;
Goodrich, MA ;
Olsen, DR ;
Nielsen, CW .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2005, 35 (04) :438-449
[20]   Market-based multirobot coordination: A survey and analysis [J].
Dias, M. Bernardine ;
Zlot, Robert ;
Kalra, Nidhi ;
Stentz, Anthony .
PROCEEDINGS OF THE IEEE, 2006, 94 (07) :1257-1270