Large Language Models as Zero-Shot Human Models for Human-Robot Interaction

被引：42

作者：

Zhang, Bowen ^{[1
]}

Soh, Harold ^{[2
]}

机构：

[1] Natl Univ Singapore, Dept Comp Sci, Singapore, Singapore

[2] NUS, Smart Syst Inst SSI, Singapore, Singapore

来源：

2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2023年

基金：

新加坡国家研究基金会;

关键词：

D O I：

10.1109/IROS55552.2023.10341488

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Human models play a crucial role in human-robot interaction (HRI), enabling robots to consider the impact of their actions on people and plan their behavior accordingly. However, crafting good human models is challenging; capturing context-dependent human behavior requires significant prior knowledge and/or large amounts of interaction data, both of which are difficult to obtain. In this work, we explore the potential of large language models (LLMs) - which have consumed vast amounts of human-generated text data - to act as zero-shot human models for HRI. Our experiments on three social datasets yield promising results; the LLMs are able to achieve performance comparable to purpose-built models. That said, we also discuss current limitations, such as sensitivity to prompts and spatial/numerical reasoning mishaps. Based on our findings, we demonstrate how LLM-based human models can be integrated into a social robot's planning process and applied in HRI scenarios focused on the important element of trust. Specifically, we present one case study on a simulated trust-based table-clearing task and replicate past results that relied on custom models. Next, we conduct a new robot utensil-passing experiment ( n = 65) where preliminary results show that planning with an LLM-based human model can achieve gains over a basic myopic plan. In summary, our results show that LLMs offer a promising (but incomplete) approach to human modeling for HRI.

引用

页码：7961 / 7968

页数：8

共 50 条

[21] Language Models as Zero-Shot Trajectory Generators [J].

Kwon, Teyun ;

Di Palo, Norman ;

Johns, Edward .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (07) :6728-6735

[22] MEDAGENTS: Large Language Models as Collaborators for Zero-shot Medical Reasoning [J].

Tang, Xiangru ;

Zou, Anni ;

Zhang, Zhuosheng ;

Li, Ziming ;

Zhao, Yilun ;

Zhang, Xingyao ;

Cohen, Arman ;

Gerstein, Mark .

FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, :599-621

[23] Zero-shot Bilingual App Reviews Mining with Large Language Models [J].

Wei, Jialiang ;

Courbis, Anne-Lise ;

Lambolais, Thomas ;

Xu, Binbin ;

Bernard, Pierre Louis ;

Dray, Gerard .

2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, :898-904

[24] ZERO-SHOT AUDIO TOPIC RERANKING USING LARGE LANGUAGE MODELS [J].

Qian, Mengjie ;

Ma, Rao ;

Liusie, Adian ;

Loweimi, Erfan ;

Knill, Kate M. ;

Gales, Mark J. E. .

2024 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2024, :1099-1106

[25] Combining Small Language Models and Large Language Models for Zero-Shot NL2SQL [J].

Fan, Ju ;

Gu, Zihui ;

Zhang, Songyue ;

Zhang, Yuxin ;

Chen, Zui ;

Cao, Lei ;

Li, Guoliang ;

Madden, Samuel ;

Du, Xiaoyong ;

Tang, Nan .

PROCEEDINGS OF THE VLDB ENDOWMENT, 2024, 17 (11) :2750-2763

[26] Large Language Models Are Zero-Shot Fuzzers: Fuzzing Deep-Learning Libraries via Large Language Models [J].

Deng, Yinlin ;

Xia, Chunqiu Steven ;

Peng, Haoran ;

Yang, Chenyuan ;

Zhan, Lingming .

PROCEEDINGS OF THE 32ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2023, 2023, :423-435

[27] Zero-shot interpretable phenotyping of postpartum hemorrhage using large language models [J].

Alsentzer, Emily ;

Rasmussen, Matthew J. ;

Fontoura, Romy ;

Cull, Alexis L. ;

Beaulieu-Jones, Brett ;

Gray, Kathryn J. ;

Bates, David W. ;

Kovacheva, Vesela P. .

NPJ DIGITAL MEDICINE, 2023, 6 (01)

[28] PqE: Zero-Shot Document Expansion for Dense Retrieval with Large Language Models [J].

Liu, Jiyuan ;

Zou, Dongsheng ;

Chai, Naiquan ;

Yang, Yuming ;

Wang, Hao ;

Song, Xinyi .

NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT I, NLPCC 2024, 2025, 15359 :97-109

[29] Zero-Shot Generative Large Language Models for Systematic Review Screening Automation [J].

Wang, Shuai ;

Scells, Harrisen ;

Zhuang, Shengyao ;

Potthast, Martin ;

Koopman, Bevan ;

Zuccon, Guido .

ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT I, 2024, 14608 :403-420

[30] Improving Zero-Shot Text Matching for Financial Auditing with Large Language Models [J].

Hillebrand, Lars ;

Berger, Armin ;

Deusser, Tobias ;

Dilmaghani, Tim ;

Khaled, Mohamed ;

Kliem, Bernd ;

Loitz, Ruediger ;

Pielka, Maren ;

Leonhard, David ;

Bauckhage, Christian ;

Sifa, Rafet .

PROCEEDINGS OF THE 2023 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, DOCENG 2023, 2023,

← 1 2 3 4 5 →