Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning

被引：0

作者：

Gao, Jensen ^{[1
,2
]}

Reddy, Siddharth ^{[2
]}

Berseth, Glen ^{[2
,3
,4
]}

Dragan, Anca D. ^{[2
]}

Levine, Sergey ^{[2
]}

机构：

[1] Stanford Univ, Stanford, CA 94305 USA

[2] Univ Calif Berkeley, Berkeley, CA 94720 USA

[3] Univ Montreal, Montreal, PQ, Canada

[4] MILA, Montreal, PQ, Canada

来源：

2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2023年

关键词：

D O I：

10.1109/IROS55552.2023.10341779

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Adaptive interfaces can help users perform sequential decision-making tasks like robotic teleoperation given noisy, high-dimensional command signals (e.g., from a brain-computer interface). Recent advances in human-in-the-loop machine learning enable such systems to improve by interacting with users, but tend to be limited by the amount of data that they can collect from individual users in practice. In this paper, we propose a reinforcement learning algorithm to address this by training an interface to map raw command signals to actions using a combination of offline pre-training and online fine-tuning. To address the challenges posed by noisy command signals and sparse rewards, we develop a novel method for representing and inferring the user's long-term intent for a given trajectory. We primarily evaluate our method's ability to assist users who can only communicate through noisy, high-dimensional input channels through a user study in which 12 participants performed a simulated navigation task by using their eye gaze to modulate a 128-dimensional command signal from their webcam. The results show that our method enables successful goal navigation more often than a baseline directional interface, by learning to denoise user commands signals and provide shared autonomy assistance. We further evaluate on a simulated Sawyer pushing task with eye gaze control, and the Lunar Lander game with simulated user commands, and find that our method improves over baseline interfaces in these domains as well. Extensive ablation experiments with simulated user commands empirically motivate each component of our method.

引用

页码：7523 / 7530

页数：8

共 50 条

[21] Autonomous Boundary of Human-Machine Collaboration System Based on Reinforcement Learning
Zhang, Qianqian
Zhao, Yun-Bo
Kang, Yu
2020 AUSTRALIAN AND NEW ZEALAND CONTROL CONFERENCE (ANZCC 2020), 2020, : 160 - 165
[22] Human-Machine Collaborative Reinforcement Learning for Power Line Flow Regulation
Wang, Chenxi
Du, Youtian
Chang, Yuanlin
Guo, Zihao
Huang, Yanhao
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (04) : 5087 - 5099
[23] Shapley-Optimized Reinforcement Learning for Human-Machine Collaboration Policy
Zhang, Jie
Niu, Yiqun
He, Wei
Jin, Cheng
Wang, Chongjun
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2024, PT 2, 2025, 14851 : 291 - 300
[24] Eye-Tracking Sensors for Adaptive Aerospace Human-Machine Interfaces and Interactions
Lim, Yixiang
Gardi, Alessandro
Ezer, Neta
Kistan, Trevor
Sabatini, Roberto
2018 5TH IEEE INTERNATIONAL WORKSHOP ON METROLOGY FOR AEROSPACE (METROAEROSPACE), 2018, : 311 - 316
[25] Changes of human-machine interfaces that connect human and machines
1600, Institute of Electrical Engineers of Japan (140): : 100 - 103
[26] Human-machine interfaces for minimally invasive surgery
Tendick, F
Cavusoglu, MC
PROCEEDINGS OF THE 19TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOL 19, PTS 1-6: MAGNIFICENT MILESTONES AND EMERGING OPPORTUNITIES IN MEDICAL ENGINEERING, 1997, 19 : 2771 - 2776
[27] Asymmetric "Janus" Biogel for Human-Machine Interfaces
Wei, Yuan
He, Yingying
Wang, Chunyu
Chen, Gang
Zhao, Boxin
ADVANCED FUNCTIONAL MATERIALS, 2023, 33 (34)
[28] Automatic Generation of Smart Human-Machine Interfaces
De Biase, Maria Stella
Marrone, Stefano
Marulli, Fiammetta
PROCEEDINGS OF THE 2020 IEEE INTERNATIONAL CONFERENCE ON HUMAN-MACHINE SYSTEMS (ICHMS), 2020, : 22 - 25
[29] Information fusion in the context of human-machine interfaces
2005, Elsevier, Amsterdam, Netherlands (06)
[30] Overview of Auditory Representations in Human-Machine Interfaces
Csapo, Adam
Wersenyi, Gyoergy
ACM COMPUTING SURVEYS, 2013, 46 (02)

← 1 2 3 4 5 →