Large language models for human-robot interaction: A review

被引:115
作者
Zhang, Ceng [1 ]
Chen, Junxin [2 ]
Li, Jiatong [2 ]
Peng, Yanhong [3 ]
Mao, Zebing [4 ]
机构
[1] Natl Univ Singapore, Dept Mech Engn, Singapore 119077, Singapore
[2] Beijing Jiaotong Univ, Sch Mech Elect & Control Engn, Beijing 100044, Peoples R China
[3] Nagoya Univ, Grad Sch Engn, Dept Informat & Commun Engn, Nagoya 4648603, Japan
[4] Tokyo Inst Technol, Dept Mech Engn, Tokyo 1528550, Japan
来源
BIOMIMETIC INTELLIGENCE AND ROBOTICS | 2023年 / 3卷 / 04期
基金
日本学术振兴会;
关键词
Large language models; Human-robot interaction; Task completion; Considerations and challenges;
D O I
10.1016/j.birob.2023.100131
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
The fusion of large language models and robotic systems has introduced a transformative paradigm in human-robot interaction, offering unparalleled capabilities in natural language understanding and task execution. This review paper offers a comprehensive analysis of this nascent but rapidly evolving domain, spotlighting the recent advances of Large Language Models (LLMs) in enhancing their structures and performances, particularly in terms of multimodal input handling, high-level reasoning, and plan generation. Moreover, it probes the current methodologies that integrate LLMs into robotic systems for complex task completion, from traditional probabilistic models to the utilization of value functions and metrics for optimal decision-making. Despite these advancements, the paper also reveals the formidable challenges that confront the field, such as contextual understanding, data privacy and ethical considerations. To our best knowledge, this is the first study to comprehensively analyze the advances and considerations of LLMs in Human-Robot Interaction (HRI) based on recent progress, which provides potential avenues for further research. (c) 2023 The Author(s). Published by Elsevier B.V. on behalf of Shandong University. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页数:15
相关论文
共 122 条
[1]  
Ahn M, 2022, Arxiv, DOI arXiv:2204.01691
[2]  
Al-Rfou R, 2019, AAAI CONF ARTIF INTE, P3159
[3]  
Anand A, 2023, Arxiv, DOI arXiv:2306.16004
[4]   A New Control Architecture for Physical Human-Robot Interaction Based on Haptic Communication [J].
Aydin, Yusuf ;
Arghavani, Nasser ;
Basdogan, Cagatay .
HRI'14: PROCEEDINGS OF THE 2014 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2014, :122-123
[5]  
Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, DOI 10.48550/ARXIV.1409.0473]
[6]   Social robots for education: A review [J].
Belpaeme, Tony ;
Kennedy, James ;
Ramachandran, Aditi ;
Scassellati, Brian ;
Tanaka, Fumihide .
SCIENCE ROBOTICS, 2018, 3 (21)
[7]   On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? [J].
Bender, Emily M. ;
Gebru, Timnit ;
McMillan-Major, Angelina ;
Shmitchell, Shmargaret .
PROCEEDINGS OF THE 2021 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, FACCT 2021, 2021, :610-623
[8]  
Bianchi M, 2016, IEEE ROMAN, P125, DOI 10.1109/ROMAN.2016.7745100
[9]  
Brants T., 2007, Large Language Models in Machine Translation
[10]  
Brohan A, 2023, Arxiv, DOI arXiv:2307.15818