Multi-Channel Interactive Reinforcement Learning for Sequential Tasks

被引：9

作者：

Koert, Dorothea ^{[1
,2
]}

Kircher, Maximilian ^{[1
]}

Salikutluk, Vildan ^{[2
,3
]}

D'Eramo, Carlo ^{[1
]}

Peters, Jan ^{[1
,4
]}

机构：

[1] Tech Univ Darmstadt, Dept Comp Sci, Intelligent Autonomous Syst Grp, Darmstadt, Germany

[2] Tech Univ Darmstadt, Ctr Cognit Sci, Darmstadt, Germany

[3] Tech Univ Darmstadt, Dept Psychol, Models Higher Cognit Grp, Darmstadt, Germany

[4] Max Planck Inst Intelligent Syst, Robot Learning Grp, Tubingen, Germany

来源：

FRONTIERS IN ROBOTICS AND AI | 2020年 / 7卷

关键词：

human-robot interaction; interactive reinforcement learning; human-centered AI; robotic tasks; user studies; BEHAVIOR;

D O I：

10.3389/frobt.2020.00097

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

The ability to learn new tasks by sequencing already known skills is an important requirement for future robots. Reinforcement learning is a powerful tool for this as it allows for a robot to learn and improve on how to combine skills for sequential tasks. However, in real robotic applications, the cost of sample collection and exploration prevent the application of reinforcement learning for a variety of tasks. To overcome these limitations, human input during reinforcement can be beneficial to speed up learning, guide the exploration and prevent the choice of disastrous actions. Nevertheless, there is a lack of experimental evaluations of multi-channel interactive reinforcement learning systems solving robotic tasks with input from inexperienced human users, in particular for cases where human input might be partially wrong. Therefore, in this paper, we present an approach that incorporates multiple human input channels for interactive reinforcement learning in a unified framework and evaluate it on two robotic tasks with 20 inexperienced human subjects. To enable the robot to also handle potentially incorrect human input we incorporate a novel concept for self-confidence, which allows the robot to question human input after an initial learning phase. The second robotic task is specifically designed to investigate if this self-confidence can enable the robot to achieve learning progress even if the human input is partially incorrect. Further, we evaluate how humans react to suggestions of the robot, once the robot notices human input might be wrong. Our experimental evaluations show that our approach can successfully incorporate human input to accelerate the learning process in both robotic tasks even if it is partially wrong. However, not all humans were willing to accept the robot's suggestions or its questioning of their input, particularly if they do not understand the learning process and the reasons behind the robot's suggestions. We believe that the findings from this experimental evaluation can be beneficial for the future design of algorithms and interfaces of interactive reinforcement learning systems used by inexperienced users.

引用

页数：19

共 50 条

[1] A Multi-Channel Reinforcement Learning Framework for Robotic Mirror Therapy
Xu, Jiajun
Xu, Linsen
Li, Youfu
Cheng, Gaoxin
Shi, Jia
Liu, Jinfu
Chen, Shouqi
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04): : 5385 - 5392
[2] Sequential Learning for Optimal Monitoring of Multi-channel Wireless Networks
Arora, Pallavi
Szepesvari, Csaba
Zheng, Rong
2011 PROCEEDINGS IEEE INFOCOM, 2011, : 1152 - 1160
[3] Sequential Learning for Multi-Channel Wireless Network Monitoring With Channel Switching Costs
Thanh Le
Szepesvari, Csaba
Zheng, Rong
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2014, 62 (22) : 5919 - 5929
[4] Dynamic Multi-channel Access in Wireless System with Deep Reinforcement Learning
Li, Fan
Zhu, Yun
Xu, Youyun
2020 12TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2020, : 283 - 287
[5] A MULTI-CHANNEL SEQUENTIAL DETECTION PROCEDURE
NADELYAYEV, YV
RADIO ENGINEERING AND ELECTRONIC PHYSICS-USSR, 1969, 14 (12): : 1842 - +
[6] Multi-Channel Opportunistic Access for Heterogeneous Networks Based on Deep Reinforcement Learning
Ye, Xiaowen
Yu, Yiding
Fu, Liqun
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (02) : 794 - 807
[7] Hierarchical Reinforcement Learning on Multi-Channel Hypergraph Neural Network for Course Recommendation
Jiang, Lu
Xiao, Yanan
Zhao, Xinxin
Xu, Yuanbo
Hu, Shuli
Wang, Pengyang
Yin, Minghao
PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 2099 - 2107
[8] MAC Protocol for Multi-channel Heterogeneous Networks Based on Deep Reinforcement Learning
Ye, Xiaowen
Yu, Yiding
Fu, Liqun
2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
[9] Multi-channel publishing of interactive multimedia presentations
Van Assche, S
Hendrickx, F
Oorts, N
Nachtergaele, L
COMPUTERS & GRAPHICS-UK, 2004, 28 (02): : 193 - 206
[10] Sequential Good Channel Search for Multi-channel Cognitive Radio
Caromi, Raied
Mohan, Seshadri
Lai, Lifeng
2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 313 - 317

← 1 2 3 4 5 →