Self-Adapting Recurrent Models for Object Pushing from Learning in Simulation

被引:13
作者
Cong, Lin [1 ]
Grner, Michael [1 ]
Ruppel, Philipp [1 ]
Liang, Hongzhuo [1 ]
Hendrich, Norman [1 ]
Zhang, Jianwei [1 ]
机构
[1] Univ Hamburg, Dept Informat, Hamburg, Germany
来源
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2020年
基金
美国国家科学基金会;
关键词
D O I
10.1109/IROS45743.2020.9341076
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Planar pushing remains a challenging research topic, where building the dynamic model of the interaction is the core issue. Even an accurate analytical dynamic model is inherently unstable because physics parameters such as inertia and friction can only be approximated. Data-driven models usually rely on large amounts of training data, but data collection is time consuming when working with real robots. In this paper, we collect all training data in a physics simulator and build an LSTM-based model to fit the pushing dynamics. Domain Randomization is applied to capture the pushing trajectories of a generalized class of objects. When executed on the real robot, the trained recursive model adapts to the tracked object's real dynamics within a few steps. We propose the algorithm Recurrent Model Predictive Path Integral (RMPPI) as a variation of the original MPPI approach, employing state-dependent recurrent models. As a comparison, we also train a Deep Deterministic Policy Gradient (DDPG) network as a model-free baseline, which is also used as the action generator in the data collection phase. During policy training, Hindsight Experience Replay is used to improve exploration efficiency. Pushing experiments on our UR5 platform demonstrate the model's adaptability and the effectiveness of the proposed framework.
引用
收藏
页码:5304 / 5310
页数:7
相关论文
共 27 条
[1]  
Akkaya I., 2019, CoRR
[2]   Learning dexterous in-hand manipulation [J].
Andrychowicz, Marcin ;
Baker, Bowen ;
Chociej, Maciek ;
Jozefowicz, Rafal ;
McGrew, Bob ;
Pachocki, Jakub ;
Petron, Arthur ;
Plappert, Matthias ;
Powell, Glenn ;
Ray, Alex ;
Schneider, Jonas ;
Sidor, Szymon ;
Tobin, Josh ;
Welinder, Peter ;
Weng, Lilian ;
Zaremba, Wojciech .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2020, 39 (01) :3-20
[3]  
[Anonymous], 2013, Model Predictive Control, DOI DOI 10.23919/ECC51009.2020.9143671
[4]  
[Anonymous], 2015, ACS SYM SER
[5]  
Arruda E, 2017, 2017 IEEE-RAS 17TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTICS (HUMANOIDS), P497, DOI 10.1109/HUMANOIDS.2017.8246918
[6]  
Bauza Maria, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P3008, DOI 10.1109/ICRA.2017.7989345
[7]  
Bauza M., 2018, NEURIPS PHYS WORKSH
[8]  
Bauza M., 2018, P C ROB LEARN
[9]  
Brockman Greg, 2016, arXiv
[10]  
Kaiser L., 2019, ICLR