Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning

被引:0
|
作者
Gao, Jensen [1 ,2 ]
Reddy, Siddharth [2 ]
Berseth, Glen [2 ,3 ,4 ]
Dragan, Anca D. [2 ]
Levine, Sergey [2 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Univ Calif Berkeley, Berkeley, CA 94720 USA
[3] Univ Montreal, Montreal, PQ, Canada
[4] MILA, Montreal, PQ, Canada
关键词
D O I
10.1109/IROS55552.2023.10341779
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Adaptive interfaces can help users perform sequential decision-making tasks like robotic teleoperation given noisy, high-dimensional command signals (e.g., from a brain-computer interface). Recent advances in human-in-the-loop machine learning enable such systems to improve by interacting with users, but tend to be limited by the amount of data that they can collect from individual users in practice. In this paper, we propose a reinforcement learning algorithm to address this by training an interface to map raw command signals to actions using a combination of offline pre-training and online fine-tuning. To address the challenges posed by noisy command signals and sparse rewards, we develop a novel method for representing and inferring the user's long-term intent for a given trajectory. We primarily evaluate our method's ability to assist users who can only communicate through noisy, high-dimensional input channels through a user study in which 12 participants performed a simulated navigation task by using their eye gaze to modulate a 128-dimensional command signal from their webcam. The results show that our method enables successful goal navigation more often than a baseline directional interface, by learning to denoise user commands signals and provide shared autonomy assistance. We further evaluate on a simulated Sawyer pushing task with eye gaze control, and the Lunar Lander game with simulated user commands, and find that our method improves over baseline interfaces in these domains as well. Extensive ablation experiments with simulated user commands empirically motivate each component of our method.
引用
收藏
页码:7523 / 7530
页数:8
相关论文
共 50 条
  • [1] Learning Algorithms for Human-Machine Interfaces
    Danziger, Zachary
    Fishbach, Alon
    Mussa-Ivaldi, Ferdinando A.
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2009, 56 (05) : 1502 - 1511
  • [2] Adaptive interfaces as an approach to human-machine cooperation
    Eggleston, RG
    DESIGN OF COMPUTING SYSTEMS: SOCIAL AND ERGONOMIC CONSIDERATIONS, 1997, 21 : 495 - 500
  • [3] Anatomically Designed Triboelectric Wristbands with Adaptive Accelerated Learning for Human-Machine Interfaces
    Fang, Han
    Wang, Lei
    Fu, Zhongzheng
    Xu, Liang
    Guo, Wei
    Huang, Jian
    Wang, Zhong Lin
    Wu, Hao
    ADVANCED SCIENCE, 2023, 10 (06)
  • [4] Adaptive human-machine interfaces in cognitive production environments
    Wallhoff, F.
    Ablassmeier, M.
    Bannat, A.
    Buchta, S.
    Rauschert, A.
    Rigoll, G.
    Wiesbeck, M.
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 2246 - +
  • [5] Machine Learning-Supported Designing of Human-Machine Interfaces
    Bantay, Laszlo
    Abonyi, Janos
    APPLIED SCIENCES-BASEL, 2024, 14 (04):
  • [6] Automatic Learning for Supporting Advanced Human-Machine Interfaces
    Cuzzocrea, Alfredo
    Mumolo, Enzo
    Grasso, Giorgio Mario
    2015 9TH INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT, AND SOFTWARE INTENSIVE SYSTEMS CISIS 2015, 2015, : 12 - 18
  • [7] The remapping of space in motor learning and human-machine interfaces
    Mussa-Ivaldi, F. A.
    Danziger, Z.
    JOURNAL OF PHYSIOLOGY-PARIS, 2009, 103 (3-5) : 263 - 275
  • [8] Exploring the transformation of user interactions to Adaptive Human-Machine Interfaces
    Carrera-Rivera, Angela
    Reguera-Bakhache, Daniel
    Larrinaga, Felix
    Lasa, Ganix
    PROCEEDINGS OF THE XXIII INTERNATIONAL CONFERENCE ON HUMAN-COMPUTER INTERACTION, INTERACCION 2023, 2023,
  • [9] Fusing Stretchable Sensing Technology with Machine Learning for Human-Machine Interfaces
    Wang, Ming
    Wang, Ting
    Luo, Yifei
    He, Ke
    Pan, Liang
    Li, Zheng
    Cui, Zequn
    Liu, Zhihua
    Tu, Jiaqi
    Chen, Xiaodong
    ADVANCED FUNCTIONAL MATERIALS, 2021, 31 (39)
  • [10] A Human-Machine Reinforcement Learning Method for Cooperative Energy Management
    Tao, Yuechuan
    Qiu, Jing
    Lai, Shuying
    Zhang, Xian
    Wang, Yunqi
    Wang, Guibin
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (05) : 2974 - 2985