Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning

被引:0
|
作者
Gao, Jensen [1 ,2 ]
Reddy, Siddharth [2 ]
Berseth, Glen [2 ,3 ,4 ]
Dragan, Anca D. [2 ]
Levine, Sergey [2 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Univ Calif Berkeley, Berkeley, CA 94720 USA
[3] Univ Montreal, Montreal, PQ, Canada
[4] MILA, Montreal, PQ, Canada
关键词
D O I
10.1109/IROS55552.2023.10341779
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Adaptive interfaces can help users perform sequential decision-making tasks like robotic teleoperation given noisy, high-dimensional command signals (e.g., from a brain-computer interface). Recent advances in human-in-the-loop machine learning enable such systems to improve by interacting with users, but tend to be limited by the amount of data that they can collect from individual users in practice. In this paper, we propose a reinforcement learning algorithm to address this by training an interface to map raw command signals to actions using a combination of offline pre-training and online fine-tuning. To address the challenges posed by noisy command signals and sparse rewards, we develop a novel method for representing and inferring the user's long-term intent for a given trajectory. We primarily evaluate our method's ability to assist users who can only communicate through noisy, high-dimensional input channels through a user study in which 12 participants performed a simulated navigation task by using their eye gaze to modulate a 128-dimensional command signal from their webcam. The results show that our method enables successful goal navigation more often than a baseline directional interface, by learning to denoise user commands signals and provide shared autonomy assistance. We further evaluate on a simulated Sawyer pushing task with eye gaze control, and the Lunar Lander game with simulated user commands, and find that our method improves over baseline interfaces in these domains as well. Extensive ablation experiments with simulated user commands empirically motivate each component of our method.
引用
收藏
页码:7523 / 7530
页数:8
相关论文
共 50 条
  • [41] Task Allocation in Human-Machine Manufacturing Systems Using Deep Reinforcement Learning
    Joo, Taejong
    Jun, Hyunyoung
    Shin, Dongmin
    SUSTAINABILITY, 2022, 14 (04)
  • [42] Towards Modern Inclusive Factories: A Methodology for the Development of Smart Adaptive Human-Machine Interfaces
    Villani, Valeria
    Sabattini, Lorenzo
    Czerniak, Julia N.
    Mertens, Alexander
    Vogel-Heuser, Birgit
    Fantuzzi, Cesare
    2017 22ND IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2017,
  • [43] Reinforcement Learning for Human-Machine Collaborative Optimization: Application in Ground Water Monitoring
    Babbar-Sebens, Meghna
    Mukhopadhyay, Snehasis
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 3563 - +
  • [44] ADAPTIVE LEARNING SYSTEMS FOR A COMPETENCE-ENHANCING HUMAN-MACHINE INTERACTION
    Lemm, Jacqueline
    Loehrer, Mario
    Gloy, Yves-Simon
    Gries, Thomas
    EDULEARN14: 6TH INTERNATIONAL CONFERENCE ON EDUCATION AND NEW LEARNING TECHNOLOGIES, 2014, : 848 - 850
  • [45] Diversification of Adaptive Policy for Effective Offline Reinforcement Learning
    Choi, Yunseon
    Zhao, Li
    Zhang, Chuheng
    Song, Lei
    Bian, Jiang
    Kim, Kee-Eung
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 3863 - 3871
  • [46] Learning to Influence Human Behavior with Offline Reinforcement Learning
    Hong, Joey
    Levine, Sergey
    Dragan, Anca
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [47] Printed, Wireless, Soft Bioelectronics and Deep Learning Algorithm for Smart Human-Machine Interfaces
    Kwon, Young-Tae
    Kim, Hojoong
    Mahmood, Musa
    Kim, Yun-Soung
    Demolder, Carl
    Yeo, Woon-Hong
    ACS APPLIED MATERIALS & INTERFACES, 2020, 12 (44) : 49398 - 49406
  • [48] A Review on Magnetic Smart Skin as Human-Machine Interfaces
    Zhang, Junjie
    Chen, Guangyuan
    Jin, Zhenhu
    Chen, Jiamin
    ADVANCED ELECTRONIC MATERIALS, 2024, 10 (05)
  • [49] Human-Machine Interfaces: A Review for Autonomous Electric Vehicles
    Mandujano-Granillo, Jesus A.
    Candela-Leal, Milton O.
    Ortiz-Vazquez, Juan J.
    Ramirez-Moreno, Mauricio A.
    Tudon-Martinez, Juan C.
    Felix-Herran, Luis C.
    Galvan-Galvan, Alfredo
    Lozoya-Santos, Jorge De J.
    IEEE ACCESS, 2024, 12 : 121635 - 121658
  • [50] Anticipation in speech-based human-machine interfaces
    Ondas, Stanislav
    Juhar, Jozef
    Kiktova, Eva
    Zimmermann, Julius
    2018 9TH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2018, : 117 - 121