Active Exploration for Robotic Manipulation

被引:0
作者
Schneider, Tim [1 ]
Belousov, Boris [1 ]
Chalvatzaki, Georgia [1 ]
Romeres, Diego [2 ]
Jha, Devesh K. [2 ]
Peters, Jan [1 ]
机构
[1] Tech Univ Darmstadt, Intelligent Autonomous Syst Lab, D-64289 Darmstadt, Germany
[2] Mitsubishi Elect Res Labs MERL, Cambridge, MA USA
来源
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2022年
关键词
HUMAN HAND; EVOLUTION;
D O I
10.1109/IROS47612.2022.9982061
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Robotic manipulation stands as a largely unsolved problem despite significant advances in robotics and machine learning in recent years. One of the key challenges in manipulation is the exploration of the dynamics of the environment when there is continuous contact between the objects being manipulated. This paper proposes a model-based active exploration approach that enables efficient learning in sparse-reward robotic manipulation tasks. The proposed method estimates an information gain objective using an ensemble of probabilistic models and deploys model predictive control (MPC) to plan actions online that maximize the expected reward while also performing directed exploration. We evaluate our proposed algorithm in simulation and on a real robot, trained from scratch with our method, on a challenging ball pushing task on tilted tables, where the target ball position is not known to the agent a-priori. Our real-world robot experiment serves as a fundamental application of active exploration in model-based reinforcement learning of complex robotic manipulation tasks. Project page https://sites.google.com/view/aerm.
引用
收藏
页码:9355 / 9362
页数:8
相关论文
共 44 条
[1]  
Amin S., 2021, SURVEY EXPLORATION M, DOI [10.48550/arXiv.2109.00157, DOI 10.48550/ARXIV.2109.00157]
[2]  
[Anonymous], 2019, PR MACH LEARN RES
[3]  
Barber D, 2004, ADV NEUR IN, V16, P201
[4]  
Bechtle S., 2019, CURIOUS ILQR RESOLVI, DOI [10.48550/arXiv.1904.06786, DOI 10.48550/ARXIV.1904.06786]
[5]  
Belghazi MI, 2018, PR MACH LEARN RES, V80
[6]  
Brockman G., 2016, ARXIV PREPRINT ARXIV
[7]  
Calandra R, 2018, Arxiv, DOI arXiv:1805.12114
[8]   Learn to Adapt to Human Walking: A Model-Based Reinforcement Learning Approach for a Robotic Assistant Rollator [J].
Chalvatzaki, Georgia ;
Papageorgiou, Xanthi S. ;
Maragos, Petros ;
Tzafestas, Costas S. .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (04) :3774-3781
[9]  
Chebotar Y, 2017, PR MACH LEARN RES, V70
[10]   The stress of nurses working in adult care in the intensive care unit [J].
de Souza, Vinicius Rodrigues ;
Lima da Silva, Jorge Luiz ;
Lopes, Mariana Ribeiro ;
dos Santos, Juliana Maise ;
Silva, Beatriz Peres ;
Galvao dos Santos, Lia Cristina .
REVISTA DE PESQUISA-CUIDADO E FUNDAMENTAL ONLINE, 2012, :25-28