Supervised autonomy for online learning in human-robot interaction

被引：30

作者：

Senft, Emmanuel ^{[1
]}

Baxter, Paul ^{[2
]}

Kennedy, James ^{[1
]}

Lemaignan, Severin ^{[1
]}

Belpaeme, Tony ^{[1
,3
]}

机构：

[1] Plymouth Univ, Plymouth PL4 8AA, Devon, England

[2] Univ Lincoln, Lincoln Ctr Autonomous Syst, Lincoln LN6 7TS, England

[3] Univ Ghent, Dept Elect & Informat Syst, Imec IDLab, Ghent, Belgium

来源：

PATTERN RECOGNITION LETTERS | 2017年 / 99卷

关键词：

Human-Robot interaction; Reinforcement learning; Interactive machine learning; Robotics; Progressive Autonomy; Supervised autonomy;

D O I：

10.1016/j.patrec.2017.03.015

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

When a robot is learning it needs to explore its environment and how its environment responds on its actions. When the environment is large and there are a large number of possible actions the robot can take, this exploration phase can take prohibitively long. However, exploration can often be optimised by letting a human expert guide the robot during its learning. Interactive machine learning, in which a human user interactively guides the robot as it learns, has been shown to be an effective way to teach a robot. It requires an intuitive control mechanism to allow the human expert to provide feedback on the robot's progress. This paper presents a novel method which combines Reinforcement Learning and Supervised Progressively Autonomous Robot Competencies (SPARC). By allowing the user to fully control the robot and by treating rewards as implicit, SPARC aims to learn an action policy while maintaining human supervisory oversight of the robot's behaviour. This method is evaluated and compared to Interactive Reinforcement Learning in a robot teaching task. Qualitative and quantitative results indicate that SPARC allows for safer and faster learning by the robot, whilst not placing a high workload on the human teacher. (C) 2017 Elsevier B.V. All rights reserved.

引用

页码：77 / 86

页数：10

共 50 条

[41] Human-Robot Proxemics: Physical and Psychological Distancing in Human-Robot Interaction [J].

Mumm, Jonathan ;

Mutlu, Bilge .

PROCEEDINGS OF THE 6TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTIONS (HRI 2011), 2011, :331-338

[42] Editorial: Shared Autonomy-Learning of Joint Action and Human-Robot Collaboration [J].

Schilling, Malte ;

Burgard, Wolfram ;

Muelling, Katharina ;

Wrede, Britta ;

Ritter, Helge .

FRONTIERS IN NEUROROBOTICS, 2019, 13

[43] Expressiveness in human-robot interaction [J].

Marti, Patrizia ;

Giusti, Leonardo ;

Pollini, Alessandro ;

Rullo, Alessia .

INTERACTION DESIGN AND ARCHITECTURES, 2008, (5-6) :93-98

[44] Communication in Human-Robot Interaction [J].

Andrea Bonarini .

Current Robotics Reports, 2020, 1 (4) :279-285

[45] Human-robot interaction and psychoanalysis [J].

Scalzone, Franco ;

Tamburrini, Guglielmo .

AI & SOCIETY, 2013, 28 (03) :297-307

[46] The Science of Human-Robot Interaction [J].

Kiesler, Sara ;

Goodrich, Michael A. .

ACM TRANSACTIONS ON HUMAN-ROBOT INTERACTION, 2018, 7 (01)

[47] Sound in Human-Robot Interaction [J].

Pelikan, Hannah ;

Robinson, Frederic Anthony ;

Keevallik, Leelo ;

Velonaki, Mari ;

Broth, Mathias ;

Bown, Oliver .

HRI '21: COMPANION OF THE 2021 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2021, :706-708

[48] Semiotics and human-robot interaction [J].

Sequeira, Joao Silva ;

Ribeiro, Maria Isabel .

ICINCO 2006: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS: ROBOTICS AND AUTOMATION, 2006, :58-65

[49] Cognitive Objects for Human-Computer Interaction and Human-Robot Interaction [J].

Moeller, Andreas ;

Roalter, Luis ;

Kranz, Matthias .

PROCEEDINGS OF THE 6TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTIONS (HRI 2011), 2011, :207-208

[50] Immersive Human-Robot Interaction [J].

Sandygulova, Anara ;

Campbell, Abraham G. ;

Dragone, Mauro ;

O'Hare, G. M. P. .

HRI'12: PROCEEDINGS OF THE SEVENTH ANNUAL ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2012, :227-228

← 1 2 3 4 5 →