Enhancing Robot Task Planning and Execution through Multi-Layer Large Language Models

被引:0
|
作者
Luan, Zhirong [1 ]
Lai, Yujun [1 ]
Huang, Rundong [1 ]
Bai, Shuanghao [2 ]
Zhang, Yuedi [2 ]
Zhang, Haoran [2 ]
Wang, Qian [1 ]
机构
[1] Xian Univ Technol, Sch Elect Engn, Xian 710000, Peoples R China
[2] Xi An Jiao Tong Univ, Coll Artificial Intelligence, Xian 710000, Peoples R China
基金
中国国家自然科学基金;
关键词
robots; large language models; natural language; semantic alignment method;
D O I
10.3390/s24051687
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Large language models have found utility in the domain of robot task planning and task decomposition. Nevertheless, the direct application of these models for instructing robots in task execution is not without its challenges. Limitations arise in handling more intricate tasks, encountering difficulties in effective interaction with the environment, and facing constraints in the practical executability of machine control instructions directly generated by such models. In response to these challenges, this research advocates for the implementation of a multi-layer large language model to augment a robot's proficiency in handling complex tasks. The proposed model facilitates a meticulous layer-by-layer decomposition of tasks through the integration of multiple large language models, with the overarching goal of enhancing the accuracy of task planning. Within the task decomposition process, a visual language model is introduced as a sensor for environment perception. The outcomes of this perception process are subsequently assimilated into the large language model, thereby amalgamating the task objectives with environmental information. This integration, in turn, results in the generation of robot motion planning tailored to the specific characteristics of the current environment. Furthermore, to enhance the executability of task planning outputs from the large language model, a semantic alignment method is introduced. This method aligns task planning descriptions with the functional requirements of robot motion, thereby refining the overall compatibility and coherence of the generated instructions. To validate the efficacy of the proposed approach, an experimental platform is established utilizing an intelligent unmanned vehicle. This platform serves as a means to empirically verify the proficiency of the multi-layer large language model in addressing the intricate challenges associated with both robot task planning and execution.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Self-Planning Code Generation with Large Language Models
    Jiang, Xue
    Dong, Yihong
    Wang, Lecheng
    Fang, Zheng
    Shang, Qiwei
    Li, Ge
    Jin, Zhi
    Jiao, Wenpin
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2024, 33 (07)
  • [42] Evaluation of Pretrained Large Language Models in Embodied Planning Tasks
    Sarkisyan, Christina
    Korchemnyi, Alexandr
    Kovalev, Alexey K.
    Panov, Aleksandr, I
    ARTIFICIAL GENERAL INTELLIGENCE, AGI 2023, 2023, 13921 : 222 - 232
  • [43] Improving Large Language Models in Multi-party Conversations Through Role-Playing
    Zhong, Yilin
    Xie, Jiahao
    Wang, Jie
    Fan, Bo
    Fang, Zhaoer
    Peng, Banghuang
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT I, ICIC 2024, 2024, 14875 : 209 - 220
  • [44] Generation of Robot Manipulation Plans Using Generative Large Language Models
    Toberg, Jan-Philipp
    Cimiano, Philipp
    2023 SEVENTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING, IRC 2023, 2023, : 190 - 197
  • [45] Enhancing Chinese Essay Discourse Logic Evaluation Through Optimized Fine-Tuning of Large Language Models
    Song, Jinwang
    Song, Yanxin
    Zhou, Guangyu
    Fu, Wenhui
    Zhang, Kunli
    Zan, Hongying
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT V, NLPCC 2024, 2025, 15363 : 342 - 352
  • [46] A Framework for Enhancing Statute Law Retrieval Using Large Language Models
    Pham, Trang Ngoc Anh
    Do, Dinh-Truong
    Nguyen, Minh Le
    NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE, JSAI-ISAI 2024, 2024, 14741 : 247 - 259
  • [47] Enhancing Accessibility in Software Engineering Projects with Large Language Models (LLMs)
    Aljedaani, Wajdi
    Eler, Marcelo Medeiros
    Parthasarathy, P. D.
    PROCEEDINGS OF THE 56TH ACM TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION, SIGCSE TS 2025, VOL 2, 2025, : 25 - 31
  • [48] Large Language Models in Robot Programming Potential in the programming of industrial robots
    Syniawa, Daniel
    Ates, Baris
    Boshoff, Marius
    Kuhlenkoetter, Bernd
    ATP MAGAZINE, 2024, (6-7):
  • [49] Enhancing Network Management Using Code Generated by Large Language Models
    Mani, Sathiya Kumaran
    Zhou, Yajie
    Hsieh, Kevin
    Segarra, Santiago
    Eberl, Trevor
    Azulai, Eliran
    Frizler, Ido
    Chandra, Ranveer
    Kandula, Srikanth
    PROCEEDINGS OF THE 22ND ACM WORKSHOP ON HOT TOPICS IN NETWORKS, HOTNETS 2023, 2023, : 196 - 204
  • [50] Enhancing Accessibility in Software Engineering Projects with Large Language Models (LLMs)
    Aljedaani, Wajdi
    Eler, Marcelo Medeiros
    Parthasarathy, P. D.
    PROCEEDINGS OF THE 56TH ACM TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION, SIGCSE TS 2025, VOL 1, 2025, : 25 - 31