Teacher feedback to scaffold and refine demonstrated motion primitives on a mobile robot

被引:16
作者
Argall, Brenna D. [1 ]
Browning, Brett [1 ]
Veloso, Manuela M. [2 ]
机构
[1] Carnegie Mellon Univ, Inst Robot, Pittsburgh, PA 15224 USA
[2] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15224 USA
关键词
Demonstration learning; Policy reuse; Teacher feedback; Robot motion control;
D O I
10.1016/j.robot.2010.11.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Task demonstration is an effective technique for developing robot motion control policies. As tasks become more complex, however, demonstration can become more difficult. In this work, we introduce an algorithm that uses corrective human feedback to build a policy able to perform a novel task, by combining simpler policies learned from demonstration. While some demonstration-based learning approaches do adapt policies with execution experience, few provide corrections within low-level motion control domains or to enable the linking of multiple of demonstrated policies. Here we introduce Feedback for Policy Scaffolding (FPS) as an algorithm that first evaluates and corrects the execution of motion primitive policies learned from demonstration. The algorithm next corrects and enables the execution of a more complex task constructed from these primitives. Key advantages of building a policy from demonstrated primitives is the potential for primitive policy reuse within multiple complex policies and the faster development of these policies, in addition to the development of complex policies for which full demonstration is difficult. Policy reuse under our algorithm is assisted by human teacher feedback, which also contributes to the improvement of policy performance. Within a simulated robot motion control domain we validate that, using FPS, a policy for a novel task is successfully built from motion primitives learned from demonstration. We show feedback to both aid and enable policy development, improving policy performance in success, speed and efficiency. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:243 / 255
页数:13
相关论文
共 25 条
[1]  
[Anonymous], P 3 ACM IEEE INT C H
[2]  
[Anonymous], P IEEE RSJ INT C INT
[3]  
[Anonymous], P IEEE INT S APPR DY
[4]  
[Anonymous], INT S EXP ROB SING
[5]  
[Anonymous], THESIS CARNEGIE MELL
[6]  
[Anonymous], IMITATION ANIMALS AR
[7]  
[Anonymous], P IEEE INT C ROB AUT
[8]  
[Anonymous], 1 ANN C HUM ROB INT
[9]  
[Anonymous], 2005, P 22 INT C MACH LEAR
[10]  
[Anonymous], SPIE MOBILE ROBOTS