Integration of Evolutionary Computing and Reinforcement Learning for Robotic Imitation Learning

被引：0

作者：

Tan, Huan ^{[1
]}

Balajee, Kannan ^{[1
]}

Lynn, DeRose ^{[1
]}

机构：

[1] GE Global Res, Gen Elect, Niskayuna, NY USA

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC) | 2014年

关键词：

Robotics; Imitation Learning; Reinforcement Learning; Evolutionary Algorithm; MOTION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes an evolutionary reinforcement learning method by combining Estimation of Distribution Algorithm and Reinforcement Learning. The Reinforcement Learning method in our method is based on Policy Improvement with Path Integrals (PI2). Estimation of Distribution Algorithm is incorporated into this reinforcement learning method to improve the generation of roll outs with certain noises. This method can accelerate the converging of the learning results and improve the overall system performance. Additionally, this method provides a potential solution to integrate the exploratory evolutionary algorithms and the greedy policy learning method. The proposed method is applied in a robotic imitation learning experiment in this paper and the experimental results demonstrate the effectiveness and robustness of our proposed algorithm.

引用

页码：407 / 412

页数：6

共 17 条

[1] A study on acquiring underlying behavioral criteria for manipulator motion by focusing on learning efficiency
An, Min
Taura, Toshiharu
Shiose, Takayuki
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2007, 37 (04): : 445 - 455
[2] [Anonymous], 2006, NEW EVOLUTIONARY COM
[3] Arikan O, 2002, ACM T GRAPHIC, V21, P483, DOI 10.1145/566570.566606
[4] Atkeson CG, 1997, IEEE INT CONF ROBOT, P1706, DOI 10.1109/ROBOT.1997.614389
[5] Atkeson CG, 1997, ARTIF INTELL REV, V11, P11, DOI 10.1023/A:1006559212014
[6] On learning, representing, and generalizing a task in a humanoid robot
Calinon, Sylvain
Guenter, Florent
Billard, Aude
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02): : 286 - 298
[7] Dillmann R., 1995, PROC IEEE INT S INTE, P185
[8] Hauschild M., 2011, SWARM EVOLUTIONARY C
[9] Ijspeert A.J., 2003, ADV NEURAL INFORM PR, V15, P1523
[10] Larranaga P., 2001, Estimation of Distribution Algorithms: ANew Tool for Evolutionary Computation

← 1 2 →