Hybrid reinforcement learning and its application to biped robot control

被引:0
|
作者
Yamada, S [1 ]
Watanabe, A [1 ]
Nakashima, M [1 ]
机构
[1] Mitsubishi Elect Corp, Adv Technol R&D Ctr, Amagasaki, Hyogo 6610001, Japan
关键词
D O I
暂无
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
A learning system composed of linear control modules, reinforcement learning modules and selection modules (a hybrid reinforcement learning system) is proposed for the fast learning of real-world control problems. The selection modules choose one appropriate control module dependent on the state. This hybrid learning system was applied to the control of a stilt-type biped robot. It learned the control on a sloped floor more quickly than the usual reinforcement learning because it did not need to learn the control on a fiat floor, where the linear control module can control the robot. When it was trained by a 2-step learning (during the first learning step, the selection module was trained by a training procedure controlled only by the linear controller), it learned the control more quickly. The average number of trials (about 50) is so small that the learning system is applicable to real robot control.
引用
收藏
页码:1071 / 1077
页数:7
相关论文
共 50 条
  • [11] Balance Control of a Biped Robot on a Rotating Platform Based on Efficient Reinforcement Learning
    Ao Xi
    Thushal Wijekoon Mudiyanselage
    Dacheng Tao
    Chao Chen
    IEEE/CAAJournalofAutomaticaSinica, 2019, 6 (04) : 938 - 951
  • [12] Application of reinforcement learning to dexterous robot control
    Bucak, IO
    Zohdy, MA
    PROCEEDINGS OF THE 1998 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 1998, : 1405 - 1409
  • [13] Biped Balance Control by Reinforcement Learning
    Hwang, Kao-Shing
    Lin, Jin-Ling
    Li, Jhe-Syun
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2016, 32 (04) : 1041 - 1060
  • [14] Reinforcement learning for a biped robot to climb sloping surfaces
    Salatian, AW
    Yi, KY
    Zheng, YF
    JOURNAL OF ROBOTIC SYSTEMS, 1997, 14 (04): : 283 - 296
  • [15] Gait Balance of Biped Robot based on Reinforcement Learning
    Hwang, Kao-Shing
    Li, Jhe-Syun
    Jiang, Wei-Cheng
    Wang, Wei-Han
    2013 PROCEEDINGS OF SICE ANNUAL CONFERENCE (SICE), 2013, : 435 - 439
  • [16] Reinforcement learning for a CPG-driven biped robot
    Mori, T
    Nakamura, Y
    Sato, M
    Ishii, S
    PROCEEDING OF THE NINETEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE SIXTEENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2004, : 623 - 630
  • [17] Motion Control for Biped Robot via DDPG-based Deep Reinforcement Learning
    Wu, Xiaoguang
    Liu, Shaowei
    Zhang, Tianci
    Yang, Lei
    Li, Yanhui
    Wang, Tingjin
    2018 WRC SYMPOSIUM ON ADVANCED ROBOTICS AND AUTOMATION (WRC SARA), 2018, : 40 - 45
  • [18] Adaptive Reinforcement Learning and Its Application to Robot Compliance Learning
    Department of Mechanical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge
    MA
    02139, United States
    J. Rob. Mechatronics, 3 (250-262):
  • [19] Fuzzy reinforcement learning and its application in robot navigation
    Duan, Y
    Xu, XH
    Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 899 - 904
  • [20] Introduction of Fixed Mode States into Online Reinforcement Learning with Penalties and Rewards and its Application to Biped Robot Waist Trajectory Generation
    Kuroda, Seiya
    Miyazaki, Kazuteru
    Kobayashi, Hiroaki
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2012, 16 (06) : 758 - 768