Large Language Models as Zero-Shot Human Models for Human-Robot Interaction

被引：54

作者：

Zhang, Bowen ^{[1
]}

Soh, Harold ^{[2
]}

机构：

[1] Natl Univ Singapore, Dept Comp Sci, Singapore, Singapore

[2] NUS, Smart Syst Inst SSI, Singapore, Singapore

来源：

2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2023年

基金：

新加坡国家研究基金会;

关键词：

D O I：

10.1109/IROS55552.2023.10341488

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Human models play a crucial role in human-robot interaction (HRI), enabling robots to consider the impact of their actions on people and plan their behavior accordingly. However, crafting good human models is challenging; capturing context-dependent human behavior requires significant prior knowledge and/or large amounts of interaction data, both of which are difficult to obtain. In this work, we explore the potential of large language models (LLMs) - which have consumed vast amounts of human-generated text data - to act as zero-shot human models for HRI. Our experiments on three social datasets yield promising results; the LLMs are able to achieve performance comparable to purpose-built models. That said, we also discuss current limitations, such as sensitivity to prompts and spatial/numerical reasoning mishaps. Based on our findings, we demonstrate how LLM-based human models can be integrated into a social robot's planning process and applied in HRI scenarios focused on the important element of trust. Specifically, we present one case study on a simulated trust-based table-clearing task and replicate past results that relied on custom models. Next, we conduct a new robot utensil-passing experiment ( n = 65) where preliminary results show that planning with an LLM-based human model can achieve gains over a basic myopic plan. In summary, our results show that LLMs offer a promising (but incomplete) approach to human modeling for HRI.

引用

页码：7961 / 7968

页数：8

共 50 条

[11] Zero-Shot Prediction of Conversational Derailment With Large Language Models [J].

Nonaka, Kenya ;

Yoshida, Mitsuo .

IEEE ACCESS, 2025, 13 :55081-55093

[12] Large Language Models Are Zero-Shot Time Series Forecasters [J].

Gruver, Nate ;

Finzi, Marc ;

Qiu, Shikai ;

Wilson, Andrew Gordon .

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,

[13] Examining Zero-Shot Vulnerability Repair with Large Language Models [J].

Pearce, Hammond ;

Tan, Benjamin ;

Ahmad, Baleegh ;

Karri, Ramesh ;

Dolan-Gavitt, Brendan .

2023 IEEE SYMPOSIUM ON SECURITY AND PRIVACY, SP, 2023, :2339-2356

[14] Large Language Models are Zero-Shot Next Location Predictors [J].

Beneduce, Ciro ;

Lepri, Bruno ;

Luca, Massimiliano .

IEEE ACCESS, 2025, 13 :77456-77467

[15] Examining Zero-Shot Vulnerability Repair with Large Language Models [J].

Pearce, Hammond ;

Tan, Benjamin ;

Ahmad, Baleegh ;

Karri, Ramesh ;

Dolan-Gavitt, Brendan .

2023 IEEE SYMPOSIUM ON SECURITY AND PRIVACY, SP, 2023, :2339-2356

[16] Revisiting Large Language Models as Zero-shot Relation Extractors [J].

Li, Guozheng ;

Wang, Peng ;

Ke, Wenjun .

FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, :6877-6892

[17] Multi-modal Language Models for Human-Robot Interaction [J].

Janssens, Ruben .

COMPANION OF THE 2024 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, HRI 2024 COMPANION, 2024, :109-111

[18] Multi-turn Instruction Invocation on Human-Robot Interaction by Large Language Models [J].

Cheng, Baoping ;

Huang, Yong ;

Sun, Xiaoran ;

Hu, Jingxi ;

Li, Bo ;

Pu, Qiran ;

Wu, Zijian ;

Tao, Xiaoming .

INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT VII, 2025, 15207 :207-219

[19] Talk With Machines: Enhancing Human-Robot Interaction Through Large/Vision Language Models [J].

Abbas, Ammar N. ;

Beleznai, Csaba .

2024 EIGHTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING, IRC 2024, 2024, :253-258

[20] Comparison of various models of robot and human in human-robot interaction [J].

Luh, JYS ;

Hu, SY .

1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, :1139-1144

← 1 2 3 4 5 →