TARG: Tree of Action-reward Generation With Large Language Model for Cabinet Opening Using Manipulator

被引：0

作者：

Park, Sung-Gil ^{[1
,2
]}

Kim, Han-Byeol ^{[2
]}

Lee, Yong-Jun ^{[1
]}

Ahn, Woo-Jin ^{[1
]}

Lim, Myo Taeg ^{[1
]}

机构：

[1] Korea Univ, Sch Elect Engn, 145 Anamro, Seoul 02841, South Korea

[2] LG Elect, 19 Yangjae Daero 11 Gil, Seoul 06772, South Korea

来源：

INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS | 2025年 / 23卷 / 02期

基金：

新加坡国家研究基金会;

关键词：

Large language model; reinforcement learning; reward generation; robot manipulation; task planning;

D O I：

10.1007/s12555-024-0528-6

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In robotics, reinforcement learning (RL) is often used to help robots learn complex tasks through interactions with their environment. A crucial aspect of RL is the design of reward functions; these functions guide the learning process by providing feedback on a robot's actions. However, crafting these reward functions manually is time-consuming and requires extensive human expertise. In this paper, we propose a tree of action-reward generation (TARG) model that automates reward generation for a given task without the need for human fine-tuning. By using a large language model (LLM), we create a systematic action plan sequence to generate a tree of action that guides RL training. Proposed method facilitates the automatic generation of a reward tree, which stabilizes the training process. To demonstrate the effectiveness of the proposed TARG framework, we conducted experiments involving a cabinet opening task within the IsaacSim simulation environment. The results demonstrated the potential of the proposed framework to significantly improve the adaptability and performance of robots in complex settings.

引用

页码：449 / 458

页数：10

共 59 条

[1] Abbeel P., 2004, Proc. of Proceedings of the twenty-first International Conference on Machine Learning
[2] Achiam J., 2023, ARXIV, DOI DOI 10.48550/ARXIV.2303.08774
[3] Ahn M, 2022, Arxiv, DOI arXiv:2204.01691
[4] VQA: Visual Question Answering
Antol, Stanislaw
Agrawal, Aishwarya
Lu, Jiasen
Mitchell, Margaret
Batra, Dhruv
Zitnick, C. Lawrence
Parikh, Devi
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2425 - 2433
[5] Barakazi M., 2022, Journal of Tourism and Gastronomy Studies, V10, P895, DOI DOI 10.21325/JOTAGS.2022.1021
[6] Warehouse robot market boosted by Covid pandemic and technological innovations
Bogue, Robert
[J]. INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2022, 49 (02): : 181 - 186
[7] Brohan A, 2023, Arxiv, DOI arXiv:2307.15818
[8] Brown TB, 2020, ADV NEUR IN, V33
[9] Planning for Autonomous Door Opening with a Mobile Manipulator
Chitta, Sachin
Cohen, Benjamin
Likhachev, Maxim
[J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, : 1799 - 1806
[10] Door-Opening Control of a Service Robot Using the Multifingered Robot Hand
Chung, Woojin
Rhee, Changju
Shim, Youngbo
Lee, Hyungjin
Park, Shinsuk
[J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2009, 56 (10) : 3975 - 3984

← 1 2 3 4 5 6 →