Large Language Models as Zero-Shot Human Models for Human-Robot Interaction

被引:42
作者
Zhang, Bowen [1 ]
Soh, Harold [2 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, Singapore, Singapore
[2] NUS, Smart Syst Inst SSI, Singapore, Singapore
来源
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2023年
基金
新加坡国家研究基金会;
关键词
D O I
10.1109/IROS55552.2023.10341488
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human models play a crucial role in human-robot interaction (HRI), enabling robots to consider the impact of their actions on people and plan their behavior accordingly. However, crafting good human models is challenging; capturing context-dependent human behavior requires significant prior knowledge and/or large amounts of interaction data, both of which are difficult to obtain. In this work, we explore the potential of large language models (LLMs) - which have consumed vast amounts of human-generated text data - to act as zero-shot human models for HRI. Our experiments on three social datasets yield promising results; the LLMs are able to achieve performance comparable to purpose-built models. That said, we also discuss current limitations, such as sensitivity to prompts and spatial/numerical reasoning mishaps. Based on our findings, we demonstrate how LLM-based human models can be integrated into a social robot's planning process and applied in HRI scenarios focused on the important element of trust. Specifically, we present one case study on a simulated trust-based table-clearing task and replicate past results that relied on custom models. Next, we conduct a new robot utensil-passing experiment ( n = 65) where preliminary results show that planning with an LLM-based human model can achieve gains over a basic myopic plan. In summary, our results show that LLMs offer a promising (but incomplete) approach to human modeling for HRI.
引用
收藏
页码:7961 / 7968
页数:8
相关论文
共 50 条
[21]   Language Models as Zero-Shot Trajectory Generators [J].
Kwon, Teyun ;
Di Palo, Norman ;
Johns, Edward .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (07) :6728-6735
[22]   MEDAGENTS: Large Language Models as Collaborators for Zero-shot Medical Reasoning [J].
Tang, Xiangru ;
Zou, Anni ;
Zhang, Zhuosheng ;
Li, Ziming ;
Zhao, Yilun ;
Zhang, Xingyao ;
Cohen, Arman ;
Gerstein, Mark .
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, :599-621
[23]   Zero-shot Bilingual App Reviews Mining with Large Language Models [J].
Wei, Jialiang ;
Courbis, Anne-Lise ;
Lambolais, Thomas ;
Xu, Binbin ;
Bernard, Pierre Louis ;
Dray, Gerard .
2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, :898-904
[24]   ZERO-SHOT AUDIO TOPIC RERANKING USING LARGE LANGUAGE MODELS [J].
Qian, Mengjie ;
Ma, Rao ;
Liusie, Adian ;
Loweimi, Erfan ;
Knill, Kate M. ;
Gales, Mark J. E. .
2024 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2024, :1099-1106
[25]   Combining Small Language Models and Large Language Models for Zero-Shot NL2SQL [J].
Fan, Ju ;
Gu, Zihui ;
Zhang, Songyue ;
Zhang, Yuxin ;
Chen, Zui ;
Cao, Lei ;
Li, Guoliang ;
Madden, Samuel ;
Du, Xiaoyong ;
Tang, Nan .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2024, 17 (11) :2750-2763
[26]   Large Language Models Are Zero-Shot Fuzzers: Fuzzing Deep-Learning Libraries via Large Language Models [J].
Deng, Yinlin ;
Xia, Chunqiu Steven ;
Peng, Haoran ;
Yang, Chenyuan ;
Zhan, Lingming .
PROCEEDINGS OF THE 32ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2023, 2023, :423-435
[27]   Zero-shot interpretable phenotyping of postpartum hemorrhage using large language models [J].
Alsentzer, Emily ;
Rasmussen, Matthew J. ;
Fontoura, Romy ;
Cull, Alexis L. ;
Beaulieu-Jones, Brett ;
Gray, Kathryn J. ;
Bates, David W. ;
Kovacheva, Vesela P. .
NPJ DIGITAL MEDICINE, 2023, 6 (01)
[28]   PqE: Zero-Shot Document Expansion for Dense Retrieval with Large Language Models [J].
Liu, Jiyuan ;
Zou, Dongsheng ;
Chai, Naiquan ;
Yang, Yuming ;
Wang, Hao ;
Song, Xinyi .
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT I, NLPCC 2024, 2025, 15359 :97-109
[29]   Zero-Shot Generative Large Language Models for Systematic Review Screening Automation [J].
Wang, Shuai ;
Scells, Harrisen ;
Zhuang, Shengyao ;
Potthast, Martin ;
Koopman, Bevan ;
Zuccon, Guido .
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT I, 2024, 14608 :403-420
[30]   Improving Zero-Shot Text Matching for Financial Auditing with Large Language Models [J].
Hillebrand, Lars ;
Berger, Armin ;
Deusser, Tobias ;
Dilmaghani, Tim ;
Khaled, Mohamed ;
Kliem, Bernd ;
Loitz, Ruediger ;
Pielka, Maren ;
Leonhard, David ;
Bauckhage, Christian ;
Sifa, Rafet .
PROCEEDINGS OF THE 2023 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, DOCENG 2023, 2023,