Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning

被引:0
|
作者
Gao, Jensen [1 ,2 ]
Reddy, Siddharth [2 ]
Berseth, Glen [2 ,3 ,4 ]
Dragan, Anca D. [2 ]
Levine, Sergey [2 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Univ Calif Berkeley, Berkeley, CA 94720 USA
[3] Univ Montreal, Montreal, PQ, Canada
[4] MILA, Montreal, PQ, Canada
关键词
D O I
10.1109/IROS55552.2023.10341779
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Adaptive interfaces can help users perform sequential decision-making tasks like robotic teleoperation given noisy, high-dimensional command signals (e.g., from a brain-computer interface). Recent advances in human-in-the-loop machine learning enable such systems to improve by interacting with users, but tend to be limited by the amount of data that they can collect from individual users in practice. In this paper, we propose a reinforcement learning algorithm to address this by training an interface to map raw command signals to actions using a combination of offline pre-training and online fine-tuning. To address the challenges posed by noisy command signals and sparse rewards, we develop a novel method for representing and inferring the user's long-term intent for a given trajectory. We primarily evaluate our method's ability to assist users who can only communicate through noisy, high-dimensional input channels through a user study in which 12 participants performed a simulated navigation task by using their eye gaze to modulate a 128-dimensional command signal from their webcam. The results show that our method enables successful goal navigation more often than a baseline directional interface, by learning to denoise user commands signals and provide shared autonomy assistance. We further evaluate on a simulated Sawyer pushing task with eye gaze control, and the Lunar Lander game with simulated user commands, and find that our method improves over baseline interfaces in these domains as well. Extensive ablation experiments with simulated user commands empirically motivate each component of our method.
引用
收藏
页码:7523 / 7530
页数:8
相关论文
共 50 条
  • [11] Human-Machine Coadaptation Based on Reinforcement Learning with Policy Gradients
    Tahboub, Karim A.
    2019 8TH INTERNATIONAL CONFERENCE ON SYSTEMS AND CONTROL (ICSC'19), 2019, : 247 - 251
  • [12] An Optimization Framework for Information Management in Adaptive Automotive Human-Machine Interfaces
    Tufano, Francesco
    Bahadure, Sushant Waman
    Tufo, Manuela
    Novella, Luigi
    Fiengo, Giovanni
    Santini, Stefania
    APPLIED SCIENCES-BASEL, 2023, 13 (19):
  • [13] Structured dataset of human-machine interactions enabling adaptive user interfaces
    Angela Carrera-Rivera
    Daniel Reguera-Bakhache
    Felix Larrinaga
    Ganix Lasa
    Iñaki Garitano
    Scientific Data, 10
  • [14] Structured dataset of human-machine interactions enabling adaptive user interfaces
    Carrera-Rivera, Angela
    Reguera-Bakhache, Daniel
    Larrinaga, Felix
    Lasa, Ganix
    Garitano, Inaki
    SCIENTIFIC DATA, 2023, 10 (01)
  • [15] Human-Machine Interfaces Based on Biosignals
    Schultz, Tanja
    Amma, Christoph
    Heger, Dominic
    Putze, Felix
    Wand, Michael
    AT-AUTOMATISIERUNGSTECHNIK, 2013, 61 (11) : 760 - 769
  • [16] Architectures for adaptable human-machine interfaces
    Hefley, W.E.
    Proceedings of the International Conference on Human Aspects of Advanced Manufacturing and Hybrid Automation, 1990,
  • [17] Auditory displays in human-machine interfaces
    Johannsen, G
    PROCEEDINGS OF THE IEEE, 2004, 92 (04) : 742 - 758
  • [18] Principles for External Human-Machine Interfaces
    Wilbrink, Marc
    Cieler, Stephan
    Weiss, Sebastian L.
    Beggiato, Matthias
    Joisten, Philip
    Feierle, Alexander
    Oehl, Michael
    INFORMATION, 2023, 14 (08)
  • [19] Electronic Devices for Human-Machine Interfaces
    Wang, Hong
    Ma, Xiaohua
    Hao, Yue
    ADVANCED MATERIALS INTERFACES, 2017, 4 (04):
  • [20] Human-Machine Interfaces: Methods of Control
    Edwards, John
    IEEE SIGNAL PROCESSING MAGAZINE, 2015, 32 (04) : 8 - 11