Multi-Channel Interactive Reinforcement Learning for Sequential Tasks

被引:9
|
作者
Koert, Dorothea [1 ,2 ]
Kircher, Maximilian [1 ]
Salikutluk, Vildan [2 ,3 ]
D'Eramo, Carlo [1 ]
Peters, Jan [1 ,4 ]
机构
[1] Tech Univ Darmstadt, Dept Comp Sci, Intelligent Autonomous Syst Grp, Darmstadt, Germany
[2] Tech Univ Darmstadt, Ctr Cognit Sci, Darmstadt, Germany
[3] Tech Univ Darmstadt, Dept Psychol, Models Higher Cognit Grp, Darmstadt, Germany
[4] Max Planck Inst Intelligent Syst, Robot Learning Grp, Tubingen, Germany
来源
FRONTIERS IN ROBOTICS AND AI | 2020年 / 7卷
关键词
human-robot interaction; interactive reinforcement learning; human-centered AI; robotic tasks; user studies; BEHAVIOR;
D O I
10.3389/frobt.2020.00097
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
The ability to learn new tasks by sequencing already known skills is an important requirement for future robots. Reinforcement learning is a powerful tool for this as it allows for a robot to learn and improve on how to combine skills for sequential tasks. However, in real robotic applications, the cost of sample collection and exploration prevent the application of reinforcement learning for a variety of tasks. To overcome these limitations, human input during reinforcement can be beneficial to speed up learning, guide the exploration and prevent the choice of disastrous actions. Nevertheless, there is a lack of experimental evaluations of multi-channel interactive reinforcement learning systems solving robotic tasks with input from inexperienced human users, in particular for cases where human input might be partially wrong. Therefore, in this paper, we present an approach that incorporates multiple human input channels for interactive reinforcement learning in a unified framework and evaluate it on two robotic tasks with 20 inexperienced human subjects. To enable the robot to also handle potentially incorrect human input we incorporate a novel concept for self-confidence, which allows the robot to question human input after an initial learning phase. The second robotic task is specifically designed to investigate if this self-confidence can enable the robot to achieve learning progress even if the human input is partially incorrect. Further, we evaluate how humans react to suggestions of the robot, once the robot notices human input might be wrong. Our experimental evaluations show that our approach can successfully incorporate human input to accelerate the learning process in both robotic tasks even if it is partially wrong. However, not all humans were willing to accept the robot's suggestions or its questioning of their input, particularly if they do not understand the learning process and the reasons behind the robot's suggestions. We believe that the findings from this experimental evaluation can be beneficial for the future design of algorithms and interfaces of interactive reinforcement learning systems used by inexperienced users.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] A Multi-Channel Reinforcement Learning Framework for Robotic Mirror Therapy
    Xu, Jiajun
    Xu, Linsen
    Li, Youfu
    Cheng, Gaoxin
    Shi, Jia
    Liu, Jinfu
    Chen, Shouqi
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04): : 5385 - 5392
  • [2] Sequential Learning for Optimal Monitoring of Multi-channel Wireless Networks
    Arora, Pallavi
    Szepesvari, Csaba
    Zheng, Rong
    2011 PROCEEDINGS IEEE INFOCOM, 2011, : 1152 - 1160
  • [3] Sequential Learning for Multi-Channel Wireless Network Monitoring With Channel Switching Costs
    Thanh Le
    Szepesvari, Csaba
    Zheng, Rong
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2014, 62 (22) : 5919 - 5929
  • [4] Dynamic Multi-channel Access in Wireless System with Deep Reinforcement Learning
    Li, Fan
    Zhu, Yun
    Xu, Youyun
    2020 12TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2020, : 283 - 287
  • [5] A MULTI-CHANNEL SEQUENTIAL DETECTION PROCEDURE
    NADELYAYEV, YV
    RADIO ENGINEERING AND ELECTRONIC PHYSICS-USSR, 1969, 14 (12): : 1842 - +
  • [6] Multi-Channel Opportunistic Access for Heterogeneous Networks Based on Deep Reinforcement Learning
    Ye, Xiaowen
    Yu, Yiding
    Fu, Liqun
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (02) : 794 - 807
  • [7] Hierarchical Reinforcement Learning on Multi-Channel Hypergraph Neural Network for Course Recommendation
    Jiang, Lu
    Xiao, Yanan
    Zhao, Xinxin
    Xu, Yuanbo
    Hu, Shuli
    Wang, Pengyang
    Yin, Minghao
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 2099 - 2107
  • [8] MAC Protocol for Multi-channel Heterogeneous Networks Based on Deep Reinforcement Learning
    Ye, Xiaowen
    Yu, Yiding
    Fu, Liqun
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [9] Multi-channel publishing of interactive multimedia presentations
    Van Assche, S
    Hendrickx, F
    Oorts, N
    Nachtergaele, L
    COMPUTERS & GRAPHICS-UK, 2004, 28 (02): : 193 - 206
  • [10] Sequential Good Channel Search for Multi-channel Cognitive Radio
    Caromi, Raied
    Mohan, Seshadri
    Lai, Lifeng
    2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 313 - 317