TARG: Tree of Action-reward Generation With Large Language Model for Cabinet Opening Using Manipulator

被引:0
作者
Park, Sung-Gil [1 ,2 ]
Kim, Han-Byeol [2 ]
Lee, Yong-Jun [1 ]
Ahn, Woo-Jin [1 ]
Lim, Myo Taeg [1 ]
机构
[1] Korea Univ, Sch Elect Engn, 145 Anamro, Seoul 02841, South Korea
[2] LG Elect, 19 Yangjae Daero 11 Gil, Seoul 06772, South Korea
基金
新加坡国家研究基金会;
关键词
Large language model; reinforcement learning; reward generation; robot manipulation; task planning;
D O I
10.1007/s12555-024-0528-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In robotics, reinforcement learning (RL) is often used to help robots learn complex tasks through interactions with their environment. A crucial aspect of RL is the design of reward functions; these functions guide the learning process by providing feedback on a robot's actions. However, crafting these reward functions manually is time-consuming and requires extensive human expertise. In this paper, we propose a tree of action-reward generation (TARG) model that automates reward generation for a given task without the need for human fine-tuning. By using a large language model (LLM), we create a systematic action plan sequence to generate a tree of action that guides RL training. Proposed method facilitates the automatic generation of a reward tree, which stabilizes the training process. To demonstrate the effectiveness of the proposed TARG framework, we conducted experiments involving a cabinet opening task within the IsaacSim simulation environment. The results demonstrated the potential of the proposed framework to significantly improve the adaptability and performance of robots in complex settings.
引用
收藏
页码:449 / 458
页数:10
相关论文
共 59 条
  • [1] Abbeel P., 2004, Proc. of Proceedings of the twenty-first International Conference on Machine Learning
  • [2] Achiam J., 2023, ARXIV, DOI DOI 10.48550/ARXIV.2303.08774
  • [3] Ahn M, 2022, Arxiv, DOI arXiv:2204.01691
  • [4] VQA: Visual Question Answering
    Antol, Stanislaw
    Agrawal, Aishwarya
    Lu, Jiasen
    Mitchell, Margaret
    Batra, Dhruv
    Zitnick, C. Lawrence
    Parikh, Devi
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2425 - 2433
  • [5] Barakazi M., 2022, Journal of Tourism and Gastronomy Studies, V10, P895, DOI DOI 10.21325/JOTAGS.2022.1021
  • [6] Warehouse robot market boosted by Covid pandemic and technological innovations
    Bogue, Robert
    [J]. INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2022, 49 (02): : 181 - 186
  • [7] Brohan A, 2023, Arxiv, DOI arXiv:2307.15818
  • [8] Brown TB, 2020, ADV NEUR IN, V33
  • [9] Planning for Autonomous Door Opening with a Mobile Manipulator
    Chitta, Sachin
    Cohen, Benjamin
    Likhachev, Maxim
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, : 1799 - 1806
  • [10] Door-Opening Control of a Service Robot Using the Multifingered Robot Hand
    Chung, Woojin
    Rhee, Changju
    Shim, Youngbo
    Lee, Hyungjin
    Park, Shinsuk
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2009, 56 (10) : 3975 - 3984