Large Language Models as Zero-Shot Human Models for Human-Robot Interaction

被引:54
作者
Zhang, Bowen [1 ]
Soh, Harold [2 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, Singapore, Singapore
[2] NUS, Smart Syst Inst SSI, Singapore, Singapore
来源
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2023年
基金
新加坡国家研究基金会;
关键词
D O I
10.1109/IROS55552.2023.10341488
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human models play a crucial role in human-robot interaction (HRI), enabling robots to consider the impact of their actions on people and plan their behavior accordingly. However, crafting good human models is challenging; capturing context-dependent human behavior requires significant prior knowledge and/or large amounts of interaction data, both of which are difficult to obtain. In this work, we explore the potential of large language models (LLMs) - which have consumed vast amounts of human-generated text data - to act as zero-shot human models for HRI. Our experiments on three social datasets yield promising results; the LLMs are able to achieve performance comparable to purpose-built models. That said, we also discuss current limitations, such as sensitivity to prompts and spatial/numerical reasoning mishaps. Based on our findings, we demonstrate how LLM-based human models can be integrated into a social robot's planning process and applied in HRI scenarios focused on the important element of trust. Specifically, we present one case study on a simulated trust-based table-clearing task and replicate past results that relied on custom models. Next, we conduct a new robot utensil-passing experiment ( n = 65) where preliminary results show that planning with an LLM-based human model can achieve gains over a basic myopic plan. In summary, our results show that LLMs offer a promising (but incomplete) approach to human modeling for HRI.
引用
收藏
页码:7961 / 7968
页数:8
相关论文
共 50 条
[11]   Zero-Shot Prediction of Conversational Derailment With Large Language Models [J].
Nonaka, Kenya ;
Yoshida, Mitsuo .
IEEE ACCESS, 2025, 13 :55081-55093
[12]   Large Language Models Are Zero-Shot Time Series Forecasters [J].
Gruver, Nate ;
Finzi, Marc ;
Qiu, Shikai ;
Wilson, Andrew Gordon .
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[13]   Examining Zero-Shot Vulnerability Repair with Large Language Models [J].
Pearce, Hammond ;
Tan, Benjamin ;
Ahmad, Baleegh ;
Karri, Ramesh ;
Dolan-Gavitt, Brendan .
2023 IEEE SYMPOSIUM ON SECURITY AND PRIVACY, SP, 2023, :2339-2356
[14]   Large Language Models are Zero-Shot Next Location Predictors [J].
Beneduce, Ciro ;
Lepri, Bruno ;
Luca, Massimiliano .
IEEE ACCESS, 2025, 13 :77456-77467
[15]   Examining Zero-Shot Vulnerability Repair with Large Language Models [J].
Pearce, Hammond ;
Tan, Benjamin ;
Ahmad, Baleegh ;
Karri, Ramesh ;
Dolan-Gavitt, Brendan .
2023 IEEE SYMPOSIUM ON SECURITY AND PRIVACY, SP, 2023, :2339-2356
[16]   Revisiting Large Language Models as Zero-shot Relation Extractors [J].
Li, Guozheng ;
Wang, Peng ;
Ke, Wenjun .
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, :6877-6892
[17]   Multi-modal Language Models for Human-Robot Interaction [J].
Janssens, Ruben .
COMPANION OF THE 2024 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, HRI 2024 COMPANION, 2024, :109-111
[18]   Multi-turn Instruction Invocation on Human-Robot Interaction by Large Language Models [J].
Cheng, Baoping ;
Huang, Yong ;
Sun, Xiaoran ;
Hu, Jingxi ;
Li, Bo ;
Pu, Qiran ;
Wu, Zijian ;
Tao, Xiaoming .
INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT VII, 2025, 15207 :207-219
[19]   Talk With Machines: Enhancing Human-Robot Interaction Through Large/Vision Language Models [J].
Abbas, Ammar N. ;
Beleznai, Csaba .
2024 EIGHTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING, IRC 2024, 2024, :253-258
[20]   Comparison of various models of robot and human in human-robot interaction [J].
Luh, JYS ;
Hu, SY .
1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, :1139-1144