Integration of Evolutionary Computing and Reinforcement Learning for Robotic Imitation Learning

被引:0
作者
Tan, Huan [1 ]
Balajee, Kannan [1 ]
Lynn, DeRose [1 ]
机构
[1] GE Global Res, Gen Elect, Niskayuna, NY USA
来源
2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC) | 2014年
关键词
Robotics; Imitation Learning; Reinforcement Learning; Evolutionary Algorithm; MOTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an evolutionary reinforcement learning method by combining Estimation of Distribution Algorithm and Reinforcement Learning. The Reinforcement Learning method in our method is based on Policy Improvement with Path Integrals (PI2). Estimation of Distribution Algorithm is incorporated into this reinforcement learning method to improve the generation of roll outs with certain noises. This method can accelerate the converging of the learning results and improve the overall system performance. Additionally, this method provides a potential solution to integrate the exploratory evolutionary algorithms and the greedy policy learning method. The proposed method is applied in a robotic imitation learning experiment in this paper and the experimental results demonstrate the effectiveness and robustness of our proposed algorithm.
引用
收藏
页码:407 / 412
页数:6
相关论文
共 17 条
  • [1] A study on acquiring underlying behavioral criteria for manipulator motion by focusing on learning efficiency
    An, Min
    Taura, Toshiharu
    Shiose, Takayuki
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2007, 37 (04): : 445 - 455
  • [2] [Anonymous], 2006, NEW EVOLUTIONARY COM
  • [3] Arikan O, 2002, ACM T GRAPHIC, V21, P483, DOI 10.1145/566570.566606
  • [4] Atkeson CG, 1997, IEEE INT CONF ROBOT, P1706, DOI 10.1109/ROBOT.1997.614389
  • [5] Atkeson CG, 1997, ARTIF INTELL REV, V11, P11, DOI 10.1023/A:1006559212014
  • [6] On learning, representing, and generalizing a task in a humanoid robot
    Calinon, Sylvain
    Guenter, Florent
    Billard, Aude
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02): : 286 - 298
  • [7] Dillmann R., 1995, PROC IEEE INT S INTE, P185
  • [8] Hauschild M., 2011, SWARM EVOLUTIONARY C
  • [9] Ijspeert A.J., 2003, ADV NEURAL INFORM PR, V15, P1523
  • [10] Larranaga P., 2001, Estimation of Distribution Algorithms: ANew Tool for Evolutionary Computation