Large language models for human-robot interaction: A review

被引：115

作者：

Zhang, Ceng ^{[1
]}

Chen, Junxin ^{[2
]}

Li, Jiatong ^{[2
]}

Peng, Yanhong ^{[3
]}

Mao, Zebing ^{[4
]}

机构：

[1] Natl Univ Singapore, Dept Mech Engn, Singapore 119077, Singapore

[2] Beijing Jiaotong Univ, Sch Mech Elect & Control Engn, Beijing 100044, Peoples R China

[3] Nagoya Univ, Grad Sch Engn, Dept Informat & Commun Engn, Nagoya 4648603, Japan

[4] Tokyo Inst Technol, Dept Mech Engn, Tokyo 1528550, Japan

来源：

BIOMIMETIC INTELLIGENCE AND ROBOTICS | 2023年 / 3卷 / 04期

基金：

日本学术振兴会;

关键词：

Large language models; Human-robot interaction; Task completion; Considerations and challenges;

D O I：

10.1016/j.birob.2023.100131

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

The fusion of large language models and robotic systems has introduced a transformative paradigm in human-robot interaction, offering unparalleled capabilities in natural language understanding and task execution. This review paper offers a comprehensive analysis of this nascent but rapidly evolving domain, spotlighting the recent advances of Large Language Models (LLMs) in enhancing their structures and performances, particularly in terms of multimodal input handling, high-level reasoning, and plan generation. Moreover, it probes the current methodologies that integrate LLMs into robotic systems for complex task completion, from traditional probabilistic models to the utilization of value functions and metrics for optimal decision-making. Despite these advancements, the paper also reveals the formidable challenges that confront the field, such as contextual understanding, data privacy and ethical considerations. To our best knowledge, this is the first study to comprehensively analyze the advances and considerations of LLMs in Human-Robot Interaction (HRI) based on recent progress, which provides potential avenues for further research. (c) 2023 The Author(s). Published by Elsevier B.V. on behalf of Shandong University. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).

引用

页数：15

共 122 条

[1]

Ahn M, 2022, Arxiv, DOI arXiv:2204.01691

[2]

Al-Rfou R, 2019, AAAI CONF ARTIF INTE, P3159

[3]

Anand A, 2023, Arxiv, DOI arXiv:2306.16004

[4] A New Control Architecture for Physical Human-Robot Interaction Based on Haptic Communication [J].

Aydin, Yusuf ;

Arghavani, Nasser ;

Basdogan, Cagatay .

HRI'14: PROCEEDINGS OF THE 2014 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2014, :122-123

[5]

Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, DOI 10.48550/ARXIV.1409.0473]

[6] Social robots for education: A review [J].

Belpaeme, Tony ;

Kennedy, James ;

Ramachandran, Aditi ;

Scassellati, Brian ;

Tanaka, Fumihide .

SCIENCE ROBOTICS, 2018, 3 (21)

[7] On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? [J].

Bender, Emily M. ;

Gebru, Timnit ;

McMillan-Major, Angelina ;

Shmitchell, Shmargaret .

PROCEEDINGS OF THE 2021 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, FACCT 2021, 2021, :610-623

[8]

Bianchi M, 2016, IEEE ROMAN, P125, DOI 10.1109/ROMAN.2016.7745100

[9]

Brants T., 2007, Large Language Models in Machine Translation

[10]

Brohan A, 2023, Arxiv, DOI arXiv:2307.15818

← 1 2 3 4 5 6 7 8 9 10 →