Hybrid reinforcement learning and its application to biped robot control

被引:0
|
作者
Yamada, S [1 ]
Watanabe, A [1 ]
Nakashima, M [1 ]
机构
[1] Mitsubishi Elect Corp, Adv Technol R&D Ctr, Amagasaki, Hyogo 6610001, Japan
关键词
D O I
暂无
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
A learning system composed of linear control modules, reinforcement learning modules and selection modules (a hybrid reinforcement learning system) is proposed for the fast learning of real-world control problems. The selection modules choose one appropriate control module dependent on the state. This hybrid learning system was applied to the control of a stilt-type biped robot. It learned the control on a sloped floor more quickly than the usual reinforcement learning because it did not need to learn the control on a fiat floor, where the linear control module can control the robot. When it was trained by a 2-step learning (during the first learning step, the selection module was trained by a training procedure controlled only by the linear controller), it learned the control more quickly. The average number of trials (about 50) is so small that the learning system is applicable to real robot control.
引用
收藏
页码:1071 / 1077
页数:7
相关论文
共 50 条
  • [41] Path Planning for a Statically Stable Biped Robot Using PRM and Reinforcement Learning
    Prasad Kulkarni
    Dip Goswami
    Prithwijit Guha
    Ashish Dutta
    Journal of Intelligent and Robotic Systems, 2006, 47 : 197 - 214
  • [42] Path planning for a statically stable biped robot using PRM and reinforcement learning
    Kulkarni, Prasad
    Goswami, Dip
    Guha, Prithwijit
    Dutta, Ashish
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2006, 47 (03) : 197 - 214
  • [43] Reinforcement learning for biped locomotion
    Sato, M
    Nakamura, Y
    Ishii, S
    ARTIFICIAL NEURAL NETWORKS - ICANN 2002, 2002, 2415 : 777 - 782
  • [44] LORM: a novel reinforcement learning framework for biped gait control
    Zhang, Weiyi
    Jiang, Yancao
    Farrukh, Fasih Ud Din
    Zhang, Chun
    Zhang, Debing
    Wang, Guangqi
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [45] Reinforcement learning for quasi-passive dynamic walking of an unstable biped robot
    Hitomi, Kentarou
    Shibata, Tomohiro
    Nakamura, Yutaka
    Ishii, Shin
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2006, 54 (12) : 982 - 988
  • [46] Residual Reinforcement Learning for Robot Control
    Johannink, Tobias
    Bahl, Shikhar
    Nair, Ashvin
    Luo, Jianlan
    Kumar, Avinash
    Loskyll, Matthias
    Ojea, Juan Aparicio
    Solowjow, Eugen
    Levine, Sergey
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 6023 - 6029
  • [47] Neuro-fuzzy gait synthesis with reinforcement learning for a biped walking robot
    Zhou C.
    Soft Computing, 2000, 4 (04) : 238 - 250
  • [48] Hybrid Surrogate Assisted Evolutionary Multiobjective Reinforcement Learning for Continuous Robot Control
    Mazumdar, Atanu
    Kyrki, Ville
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2024, PT II, 2024, 14635 : 61 - 75
  • [49] Robot Position/Force Control in Unknown Environment Using Hybrid Reinforcement Learning
    Perrusquia, Adolfo
    Yu Wen
    CYBERNETICS AND SYSTEMS, 2020, 51 (04) : 542 - 560
  • [50] Learning control for nonlinear system and its application in robot
    Yan, Xinggang
    Chen, I.M.
    Kongzhi Lilun Yu Yinyong/Control Theory and Applications, 2000, 17 (04): : 573 - 575