Supervised autonomy for online learning in human-robot interaction

被引:30
作者
Senft, Emmanuel [1 ]
Baxter, Paul [2 ]
Kennedy, James [1 ]
Lemaignan, Severin [1 ]
Belpaeme, Tony [1 ,3 ]
机构
[1] Plymouth Univ, Plymouth PL4 8AA, Devon, England
[2] Univ Lincoln, Lincoln Ctr Autonomous Syst, Lincoln LN6 7TS, England
[3] Univ Ghent, Dept Elect & Informat Syst, Imec IDLab, Ghent, Belgium
关键词
Human-Robot interaction; Reinforcement learning; Interactive machine learning; Robotics; Progressive Autonomy; Supervised autonomy;
D O I
10.1016/j.patrec.2017.03.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When a robot is learning it needs to explore its environment and how its environment responds on its actions. When the environment is large and there are a large number of possible actions the robot can take, this exploration phase can take prohibitively long. However, exploration can often be optimised by letting a human expert guide the robot during its learning. Interactive machine learning, in which a human user interactively guides the robot as it learns, has been shown to be an effective way to teach a robot. It requires an intuitive control mechanism to allow the human expert to provide feedback on the robot's progress. This paper presents a novel method which combines Reinforcement Learning and Supervised Progressively Autonomous Robot Competencies (SPARC). By allowing the user to fully control the robot and by treating rewards as implicit, SPARC aims to learn an action policy while maintaining human supervisory oversight of the robot's behaviour. This method is evaluated and compared to Interactive Reinforcement Learning in a robot teaching task. Qualitative and quantitative results indicate that SPARC allows for safer and faster learning by the robot, whilst not placing a high workload on the human teacher. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:77 / 86
页数:10
相关论文
共 50 条
  • [31] Learning Proxemics for Personalized Human-Robot Social Interaction
    Patompak, Pakpoom
    Jeong, Sungmoon
    Nilkhamhang, Itthisek
    Chong, Nak Young
    [J]. INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2020, 12 (01) : 267 - 280
  • [32] Continual Learning for Adaptive Affective Human-Robot Interaction
    Maharjan, Rahul Singh
    [J]. 2022 10TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS, ACIIW, 2022,
  • [33] A dialogue manager for multimodal human-robot interaction and learning of a humanoid robot
    Holzapfel, Hartwig
    [J]. INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2008, 35 (06): : 528 - 535
  • [34] Asymmetric Identification Model for Human-Robot Contacts via Supervised Learning
    Abu Al-Haija, Qasem
    Al-Saraireh, Ja'afer
    [J]. SYMMETRY-BASEL, 2022, 14 (03):
  • [35] Redefining User Expectations: The Impact of Adjustable Social Autonomy in Human-Robot Interaction
    Cantucci, Filippo
    Falcone, Rino
    Marini, Marco
    [J]. ELECTRONICS, 2024, 13 (01)
  • [36] Errors in Human-Robot Interactions and Their Effects on Robot Learning
    Kim, Su Kyoung
    Kirchner, Elsa Andrea
    Schlossmueller, Lukas
    Kirchner, Frank
    [J]. FRONTIERS IN ROBOTICS AND AI, 2020, 7
  • [37] On Interaction Quality in Human-Robot Interaction
    Bensch, Suna
    Jevtic, Aleksandar
    Hellstrom, Thomas
    [J]. ICAART: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2017, : 182 - 189
  • [38] Editorial: Variable autonomy for human-robot teaming
    Theodorou, Andreas
    Chiou, Manolis
    Lacerda, Bruno
    Rothfuss, Simon
    [J]. FRONTIERS IN ROBOTICS AND AI, 2024, 11
  • [39] Learning Physical Human-Robot Interaction With Coupled Cooperative Primitives for a Lower Exoskeleton
    Huang, Rui
    Cheng, Hong
    Qiu, Jing
    Zhang, Jianwei
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2019, 16 (04) : 1566 - 1574
  • [40] The Effect of Multiple Robot Interaction on Human-Robot Interaction
    Yang, Jeong-Yean
    Kwon, Dong-Soo
    [J]. 2012 9TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAL), 2012, : 30 - 33