Large Language Models as Zero-Shot Human Models for Human-Robot Interaction

被引：54

作者：

Zhang, Bowen ^{[1
]}

Soh, Harold ^{[2
]}

机构：

[1] Natl Univ Singapore, Dept Comp Sci, Singapore, Singapore

[2] NUS, Smart Syst Inst SSI, Singapore, Singapore

来源：

2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2023年

基金：

新加坡国家研究基金会;

关键词：

D O I：

10.1109/IROS55552.2023.10341488

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Human models play a crucial role in human-robot interaction (HRI), enabling robots to consider the impact of their actions on people and plan their behavior accordingly. However, crafting good human models is challenging; capturing context-dependent human behavior requires significant prior knowledge and/or large amounts of interaction data, both of which are difficult to obtain. In this work, we explore the potential of large language models (LLMs) - which have consumed vast amounts of human-generated text data - to act as zero-shot human models for HRI. Our experiments on three social datasets yield promising results; the LLMs are able to achieve performance comparable to purpose-built models. That said, we also discuss current limitations, such as sensitivity to prompts and spatial/numerical reasoning mishaps. Based on our findings, we demonstrate how LLM-based human models can be integrated into a social robot's planning process and applied in HRI scenarios focused on the important element of trust. Specifically, we present one case study on a simulated trust-based table-clearing task and replicate past results that relied on custom models. Next, we conduct a new robot utensil-passing experiment ( n = 65) where preliminary results show that planning with an LLM-based human model can achieve gains over a basic myopic plan. In summary, our results show that LLMs offer a promising (but incomplete) approach to human modeling for HRI.

引用

页码：7961 / 7968

页数：8

共 50 条

[41] Learning Anomaly Detection Models for Human-Robot Interaction [J].

Mochizuki, Shota ;

Yamashita, Sanae ;

Yuasa, Reiko ;

Kubota, Tomonori ;

Ogawa, Kohei ;

Higashinaka, Ryuichiro .

2024 33RD IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, ROMAN 2024, 2024, :1720-1725

[42] Communication Models in Human-Robot Interaction: An Asymmetric MODel of ALterity in Human-Robot Interaction (AMODAL-HRI) [J].

Frijns, Helena Anna ;

Schuerer, Oliver ;

Koeszegi, Sabine Theresia .

INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2023, 15 (03) :473-500

[43] Extensible Prompts for Language Models on Zero-shot Language Style Customization [J].

Ge, Tao ;

Hu, Jing ;

Dong, Li ;

Mao, Shaoguang ;

Xia, Yan ;

Wang, Xun ;

Chen, Si-Qing ;

Wei, Furu .

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,

[44] Safe and Efficient Exploration of Human Models During Human-Robot Interaction [J].

Pandya, Ravi ;

Liu, Changliu .

2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, :6708-6715

[45] LM-Infinite: Zero-Shot Extreme Length Generalization for Large Language Models [J].

Han, Chi ;

Wang, Qifan ;

Peng, Hao ;

Xiong, Wenhan ;

Chen, Yu ;

Ji, Heng ;

Wang, Sinong .

PROCEEDINGS OF THE 2024 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, VOL 1: LONG PAPERS, 2024, :3991-4008

[46] Zero-Shot ECG Diagnosis with Large Language Models and Retrieval-Augmented Generation [J].

Yu, Han ;

Guo, Peikun ;

Sano, Akane .

MACHINE LEARNING FOR HEALTH, ML4H, VOL 225, 2023, 225 :650-663

[47] COFT: Making Large Language Models Better zero-shot Learners for Code Generation [J].

Li, Weijia ;

Qian, Yongjie ;

Gao, Ke ;

Chen, Haixin ;

Wang, Xinyu ;

Tong, Yuchen ;

Li, Ling ;

Wu, Yanjun ;

Zhao, Chen .

2025 IEEE/ACM 33RD INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION, ICPC, 2025, :489-499

[48] Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors [J].

Zhang, Kai ;

Gutierrez, Bernal Jimenez ;

Su, Yu .

FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, :794-812

[49] Large Language Models as Zero-shot Dialogue State Tracker through Function Calling [J].

Li, Zekun ;

Chen, Zhiyu Zoey ;

Ross, Mike ;

Huber, Patrick ;

Moon, Seungwhan ;

Lin, Zhaojiang ;

Dong, Luna ;

Sagar, Adithya ;

Yan, Xifeng ;

Crook, Paul A. .

PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, :8688-8704

[50] zrLLM: Zero-Shot Relational Learning on Temporal Knowledge Graphs with Large Language Models [J].

Ding, Zifeng ;

Cai, Heling ;

Wu, Jingpein ;

Ma, Yunpu ;

Liao, Ruotong ;

Xiong, Bo ;

Tresp, Volker .

PROCEEDINGS OF THE 2024 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, VOL 1: LONG PAPERS, 2024, :1877-1895

← 1 2 3 4 5 →