Hybrid reinforcement learning and its application to biped robot control

被引:0
|
作者
Yamada, S [1 ]
Watanabe, A [1 ]
Nakashima, M [1 ]
机构
[1] Mitsubishi Elect Corp, Adv Technol R&D Ctr, Amagasaki, Hyogo 6610001, Japan
关键词
D O I
暂无
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
A learning system composed of linear control modules, reinforcement learning modules and selection modules (a hybrid reinforcement learning system) is proposed for the fast learning of real-world control problems. The selection modules choose one appropriate control module dependent on the state. This hybrid learning system was applied to the control of a stilt-type biped robot. It learned the control on a sloped floor more quickly than the usual reinforcement learning because it did not need to learn the control on a fiat floor, where the linear control module can control the robot. When it was trained by a 2-step learning (during the first learning step, the selection module was trained by a training procedure controlled only by the linear controller), it learned the control more quickly. The average number of trials (about 50) is so small that the learning system is applicable to real robot control.
引用
收藏
页码:1071 / 1077
页数:7
相关论文
共 50 条
  • [1] Stability Control of a Biped Robot on a Dynamic Platform Based on Hybrid Reinforcement Learning
    Xi, Ao
    Chen, Chao
    SENSORS, 2020, 20 (16) : 1 - 21
  • [2] Walking Control of a Biped Robot on Static and Rotating Platforms Based on Hybrid Reinforcement Learning
    Xi, Ao
    Chen, Chao
    IEEE ACCESS, 2020, 8 : 148411 - 148424
  • [3] Reinforcement learning control for biped robot walking on uneven surfaces
    Wang, Shouyi
    Braaksma, Jelmer
    Babuska, Robert
    Hobbelen, Daan
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 4173 - +
  • [4] Deep reinforcement learning method for biped robot gait control
    Feng C.
    Zhang Y.
    Huang C.
    Jiang W.
    Wu Z.
    1600, CIMS (27): : 2341 - 2349
  • [5] Reinforcement learning and its application to force control of an industrial robot
    Song, KT
    Chu, TS
    CONTROL ENGINEERING PRACTICE, 1998, 6 (01) : 37 - 44
  • [6] Control of spring actuator and its application to biped robot
    Murai, Sota
    Fujimoto, Yasutaka
    9TH IEEE INTERNATIONAL WORKSHOP ON ADVANCED MOTION CONTROL, VOLS 1 AND 2, PROCEEDINGS, 2006, : 411 - +
  • [7] Parallel Deep Reinforcement Learning Method for Gait Control of Biped Robot
    Tao, Chongben
    Xue, Jie
    Zhang, Zufeng
    Gao, Zhen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (06) : 2802 - 2806
  • [8] Natural policy gradient reinforcement learning for a CPG control of a biped robot
    Nakamura, Y
    Mori, T
    Ishii, S
    PARALLEL PROBLEM SOLVING FROM NATURE - PPSN VIII, 2004, 3242 : 972 - 981
  • [9] A Disturbance Rejection Control Method Based on Deep Reinforcement Learning for a Biped Robot
    Liu, Chuzhao
    Gao, Junyao
    Tian, Dingkui
    Zhang, Xuefeng
    Liu, Huaxin
    Meng, Libo
    APPLIED SCIENCES-BASEL, 2021, 11 (04): : 1 - 17
  • [10] Balance Control of a Biped Robot on a Rotating Platform Based on Efficient Reinforcement Learning
    Xi, Ao
    Mudiyanselage, Thushal Wijekoon
    Tao, Dacheng
    Chen, Chao
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (04) : 938 - 951