Improving user specifications for robot behavior through active preference learning: Framework and evaluation

被引:27
作者
Wilde, Nils [1 ]
Blidaru, Alexandru [1 ]
Smith, Stephen L. [1 ]
Kulic, Dana [1 ,2 ]
机构
[1] Univ Waterloo, Dept Elect & Comp Engn, 200 Univ Ave West, Waterloo, ON N2L 3G1, Canada
[2] Monash Univ, Melbourne, Vic, Australia
基金
加拿大自然科学与工程研究理事会;
关键词
Cognitive HRI; motion planning; PERFORMANCE; COMPLEXITY;
D O I
10.1177/0278364920910802
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
An important challenge in human-robot interaction (HRI) is enabling non-expert users to specify complex tasks for autonomous robots. Recently, active preference learning has been applied in HRI to interactively shape a robot's behavior. We study a framework where users specify constraints on allowable robot movements on a graphical interface, yielding a robot task specification. However, users may not be able to accurately assess the impact of such constraints on the performance of a robot. Thus, we revise the specification by iteratively presenting users with alternative solutions where some constraints might be violated, and learn about the importance of the constraints from the users' choices between these alternatives. We demonstrate our framework in a user study with a material transport task in an industrial facility. We show that nearly all users accept alternative solutions and thus obtain a revised specification through the learning process, and that the revision leads to a substantial improvement in robot performance. Further, the learning process reduces the variances between the specifications from different users and, thus, makes the specifications more similar. As a result, the users whose initial specifications had the largest impact on performance benefit the most from the interactive learning.
引用
收藏
页码:651 / 667
页数:17
相关论文
共 57 条
[1]  
Abbeel P, 2004, P 21 INT C MACH LEAR, P1
[2]  
Akgun B, 2012, ACMIEEE INT CONF HUM, P391
[3]  
Akrour Riad, 2012, Machine Learning and Knowledge Discovery in Databases. Proceedings of the European Conference (ECML PKDD 2012), P116, DOI 10.1007/978-3-642-33486-3_8
[4]  
Anderson G. T., 2007, P IEEE INT C SYST MA, P2461
[5]  
[Anonymous], 2013, ROBOTICS SCI SYSTEMS
[6]  
[Anonymous], 2015, INT J ROB RES
[7]  
[Anonymous], Aaai
[8]  
[Anonymous], 2018, Combinatorial Optimization: Theory and Algorithms
[9]   A survey of robot learning from demonstration [J].
Argall, Brenna D. ;
Chernova, Sonia ;
Veloso, Manuela ;
Browning, Brett .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2009, 57 (05) :469-483
[10]   An empirical evaluation of the System Usability Scale [J].
Bangor, Aaron ;
Kortum, Philip T. ;
Miller, James T. .
INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2008, 24 (06) :574-594