Hybrid Imitation Learning Framework for Robotic Manipulation Tasks

被引：6

作者：

Jung, Eunjin ^{[1
]}

Kim, Incheol ^{[1
]}

机构：

[1] Kyonggi Univ, Dept Comp Sci, Suwon 16227, South Korea

来源：

SENSORS | 2021年 / 21卷 / 10期

关键词：

robotic object manipulation task; hybrid imitation learning; behavior cloning; trajectory cloning; dynamics modeling;

D O I：

10.3390/s21103409

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

This study proposes a novel hybrid imitation learning (HIL) framework in which behavior cloning (BC) and state cloning (SC) methods are combined in a mutually complementary manner to enhance the efficiency of robotic manipulation task learning. The proposed HIL framework efficiently combines BC and SC losses using an adaptive loss mixing method. It uses pretrained dynamics networks to enhance SC efficiency and performs stochastic state recovery to ensure stable learning of policy networks by transforming the learner's task state into a demo state on the demo task trajectory during SC. The training efficiency and policy flexibility of the proposed HIL framework are demonstrated in a series of experiments conducted to perform major robotic manipulation tasks (pick-up, pick-and-place, and stack tasks). In the experiments, the HIL framework showed about a 2.6 times higher performance improvement than the pure BC and about a four times faster training time than the pure SC imitation learning method. In addition, the HIL framework also showed about a 1.6 times higher performance improvement and about a 2.2 times faster training time than the other hybrid learning method combining BC and reinforcement learning (BC + RL) in the experiments.

引用

页数：18

共 31 条

[1] A Fast, Robust, and Incremental Model for Learning High-Level Concepts From Human Motions by Imitation [J].

Alibeigi, Mina ;

Ahmadabadi, Majid Nili ;

Araabi, Babak Nadjar .

IEEE TRANSACTIONS ON ROBOTICS, 2017, 33 (01) :153-168

[2]

[Anonymous], 2000, Icml

[3] Deep Reinforcement Learning A brief survey [J].

Arulkumaran, Kai ;

Deisenroth, Marc Peter ;

Brundage, Miles ;

Bharath, Anil Anthony .

IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38

[4] Trends and challenges in robot manipulation [J].

Billard, Aude ;

Kragic, Danica .

SCIENCE, 2019, 364 (6446) :1149-+

[5]

Cohen Benjamin J., 2011, IEEE International Conference on Robotics and Automation, P5478

[6]

Di Palo N., 2020, ARXIV201109586

[7]

Ebert F., 2018, ARXIV181003043

[8] Probabilistic model-based imitation learning [J].

Englert, Peter ;

Paraschos, Alexandros ;

Deisenroth, Marc Peter ;

Peters, Jan .

ADAPTIVE BEHAVIOR, 2013, 21 (05) :388-403

[9] A Machine Learning Approach to Visual Perception of Forest Trails for Mobile Robots [J].

Giusti, Alessandro ;

Guzzi, Jerome ;

Ciresan, Dan C. ;

He, Fang-Lin ;

Rodriguez, Juan P. ;

Fontana, Flavio ;

Faessler, Matthias ;

Forster, Christian ;

Schmidhuber, Jurgen ;

Di Caro, Gianni ;

Scaramuzza, Davide ;

Gambardella, Luca M. .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2016, 1 (02) :661-667

[10]

Goecks V.G., 2020, P INT C AUT AG MULTI, P465

← 1 2 3 4 →