Intrinsic Language-Guided Exploration for Complex Long-Horizon Robotic Manipulation Tasks

被引：0

作者：

Triantafyllidis, Eleftherios ^{[1
]}

Christianos, Filippos ^{[1
]}

Li, Zhibin ^{[2
]}

机构：

[1] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland

[2] UCL, Dept Comp Sci, London, England

来源：

2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024 | 2024年

基金：

英国工程与自然科学研究理事会;

关键词：

D O I：

10.1109/ICRA57147.2024.10611483

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Current reinforcement learning algorithms struggle in sparse and complex environments, most notably in long-horizon manipulation tasks entailing a plethora of different sequences. In this work, we propose the Intrinsically Guided Exploration from Large Language Models (IGE-LLMs) framework. By leveraging LLMs as an assistive intrinsic reward, IGE-LLMs guides the exploratory process in reinforcement learning to address intricate long-horizon with sparse rewards robotic manipulation tasks. We evaluate our framework and related intrinsic learning methods in an environment challenged with exploration, and a complex robotic manipulation task challenged by both exploration and long-horizons. Results show IGE-LLMs (i) exhibit notably higher performance over related intrinsic methods and the direct use of LLMs in decision-making, (ii) can be combined and complement existing learning methods highlighting its modularity, (iii) are fairly insensitive to different intrinsic scaling parameters, and (iv) maintain robustness against increased levels of uncertainty and horizons.

引用

页码：7493 / 7500

页数：8

共 46 条

[1] Ahn M., 2022, Do as I. can not as I. say: Grounding language in robotic affordances
[2] Amodei D., 2016, CoRR
[3] [Anonymous], 2018, CHI EA 18, DOI DOI 10.1145/3170427.3186500
[4] [Anonymous], 2017, Advances in Neural Information Processing Systems
[5] Bellemare M., 2016, ADV NEURAL INFORM PR
[6] Trends and challenges in robot manipulation
Billard, Aude
Kragic, Danica
[J]. SCIENCE, 2019, 364 (6446) : 1149 - +
[7] Brown TB, 2020, ADV NEUR IN, V33
[8] Burda Y., 2019, INT C LEARN REPR
[9] Carta T., 2023, Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning
[10] Chentanez N., 2004, Advances in neural information processing systems, V17

← 1 2 3 4 5 →