Large Language Models as Zero-Shot Human Models for Human-Robot Interaction

被引:54
作者
Zhang, Bowen [1 ]
Soh, Harold [2 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, Singapore, Singapore
[2] NUS, Smart Syst Inst SSI, Singapore, Singapore
来源
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2023年
基金
新加坡国家研究基金会;
关键词
D O I
10.1109/IROS55552.2023.10341488
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human models play a crucial role in human-robot interaction (HRI), enabling robots to consider the impact of their actions on people and plan their behavior accordingly. However, crafting good human models is challenging; capturing context-dependent human behavior requires significant prior knowledge and/or large amounts of interaction data, both of which are difficult to obtain. In this work, we explore the potential of large language models (LLMs) - which have consumed vast amounts of human-generated text data - to act as zero-shot human models for HRI. Our experiments on three social datasets yield promising results; the LLMs are able to achieve performance comparable to purpose-built models. That said, we also discuss current limitations, such as sensitivity to prompts and spatial/numerical reasoning mishaps. Based on our findings, we demonstrate how LLM-based human models can be integrated into a social robot's planning process and applied in HRI scenarios focused on the important element of trust. Specifically, we present one case study on a simulated trust-based table-clearing task and replicate past results that relied on custom models. Next, we conduct a new robot utensil-passing experiment ( n = 65) where preliminary results show that planning with an LLM-based human model can achieve gains over a basic myopic plan. In summary, our results show that LLMs offer a promising (but incomplete) approach to human modeling for HRI.
引用
收藏
页码:7961 / 7968
页数:8
相关论文
共 50 条
[41]   Learning Anomaly Detection Models for Human-Robot Interaction [J].
Mochizuki, Shota ;
Yamashita, Sanae ;
Yuasa, Reiko ;
Kubota, Tomonori ;
Ogawa, Kohei ;
Higashinaka, Ryuichiro .
2024 33RD IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, ROMAN 2024, 2024, :1720-1725
[42]   Communication Models in Human-Robot Interaction: An Asymmetric MODel of ALterity in Human-Robot Interaction (AMODAL-HRI) [J].
Frijns, Helena Anna ;
Schuerer, Oliver ;
Koeszegi, Sabine Theresia .
INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2023, 15 (03) :473-500
[43]   Extensible Prompts for Language Models on Zero-shot Language Style Customization [J].
Ge, Tao ;
Hu, Jing ;
Dong, Li ;
Mao, Shaoguang ;
Xia, Yan ;
Wang, Xun ;
Chen, Si-Qing ;
Wei, Furu .
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[44]   Safe and Efficient Exploration of Human Models During Human-Robot Interaction [J].
Pandya, Ravi ;
Liu, Changliu .
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, :6708-6715
[45]   LM-Infinite: Zero-Shot Extreme Length Generalization for Large Language Models [J].
Han, Chi ;
Wang, Qifan ;
Peng, Hao ;
Xiong, Wenhan ;
Chen, Yu ;
Ji, Heng ;
Wang, Sinong .
PROCEEDINGS OF THE 2024 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, VOL 1: LONG PAPERS, 2024, :3991-4008
[46]   Zero-Shot ECG Diagnosis with Large Language Models and Retrieval-Augmented Generation [J].
Yu, Han ;
Guo, Peikun ;
Sano, Akane .
MACHINE LEARNING FOR HEALTH, ML4H, VOL 225, 2023, 225 :650-663
[47]   COFT: Making Large Language Models Better zero-shot Learners for Code Generation [J].
Li, Weijia ;
Qian, Yongjie ;
Gao, Ke ;
Chen, Haixin ;
Wang, Xinyu ;
Tong, Yuchen ;
Li, Ling ;
Wu, Yanjun ;
Zhao, Chen .
2025 IEEE/ACM 33RD INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION, ICPC, 2025, :489-499
[48]   Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors [J].
Zhang, Kai ;
Gutierrez, Bernal Jimenez ;
Su, Yu .
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, :794-812
[49]   Large Language Models as Zero-shot Dialogue State Tracker through Function Calling [J].
Li, Zekun ;
Chen, Zhiyu Zoey ;
Ross, Mike ;
Huber, Patrick ;
Moon, Seungwhan ;
Lin, Zhaojiang ;
Dong, Luna ;
Sagar, Adithya ;
Yan, Xifeng ;
Crook, Paul A. .
PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, :8688-8704
[50]   zrLLM: Zero-Shot Relational Learning on Temporal Knowledge Graphs with Large Language Models [J].
Ding, Zifeng ;
Cai, Heling ;
Wu, Jingpein ;
Ma, Yunpu ;
Liao, Ruotong ;
Xiong, Bo ;
Tresp, Volker .
PROCEEDINGS OF THE 2024 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, VOL 1: LONG PAPERS, 2024, :1877-1895