Gait Balance and Acceleration of a Biped Robot Based on Q-Learning

被引:37
|
作者
Lin, Jin-Ling [1 ]
Hwang, Kao-Shing [2 ]
Jiang, Wei-Cheng [2 ]
Chen, Yu-Jen [3 ]
机构
[1] Shih Hsin Univ, Dept Informat Management, Taipei 116, Taiwan
[2] Natl Sun Yat Sen Univ, Dept Elect Engn, Kaohsiung 80424, Taiwan
[3] Natl Chung Cheng Univ, Dept Elect Engn, Chiayi 62102, Taiwan
来源
IEEE ACCESS | 2016年 / 4卷
关键词
Reinforcement learning; biped robot; continuous action space; zero moment point; ALGORITHM;
D O I
10.1109/ACCESS.2016.2570255
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a method for the biped dynamic walking and balance control using reinforcement learning, which learns dynamic walking without a priori knowledge about the dynamic model. The learning architecture developed is aimed to solve complex control problems in robotic actuation control by mapping the action space from a discretized domain to a continuous one. It employs the discrete actions to construct a policy for continuous action. The architecture allows for the scaling of the dimensionality of the state space and cardinality of the action set that represents new knowledge, or new requirements for a desired task. The balance learning method utilizing the motion of robot arm and leg to shift the zero moment point on the soles of a robot can maintain the biped robot in a static stable state. This balanced algorithm is applied to biped walking on a flat surface and a seesaw and is making the biped's walks more stable. The simulation shows that the proposed method can allow the robot to learn to improve its behavior in terms of walking speed. Finally, the methods are implemented on a physical biped robot to demonstrate the feasibility and effectiveness of the proposed learning scheme.
引用
收藏
页码:2439 / 2449
页数:11
相关论文
共 50 条
  • [21] Application of role value to robot soccer based on Q-learning
    School of Mechanical Engineering and Automation, Xihua University, Chengdu 610039, China
    Dianzi Keji Diaxue Xuebao, 2007, 4 (809-812):
  • [22] Mobile robot path planning based on Q-learning algorithm
    Li, Shaochuan
    Wang, Xuiqing
    Hu, Liwei
    Liu, Ying
    2019 WORLD ROBOT CONFERENCE SYMPOSIUM ON ADVANCED ROBOTICS AND AUTOMATION (WRC SARA 2019), 2019, : 160 - 165
  • [23] Control of the trajectory of a hexapod robot based on distributed Q-learning
    Youcef, Z
    Pierre, C
    PROCEEDINGS OF THE IEEE-ISIE 2004, VOLS 1 AND 2, 2004, : 277 - 282
  • [24] A robot demonstration method based on LWR and Q-learning algorithm
    Zhao, Guangzhe
    Tao, Yong
    Liu, Hui
    Deng, Xianling
    Chen, Youdong
    Xiong, Hegen
    Xie, Xianwu
    Fang, Zengliang
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 35 (01) : 35 - 46
  • [25] Research on intelligence robot formation based on fuzzy Q-Learning
    Zhang, RB
    Shi, Y
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1936 - 1941
  • [26] Behavior Control Algorithm for Mobile Robot Based on Q-Learning
    Yang, Shiqiang
    Li, Congxiao
    2017 INTERNATIONAL CONFERENCE ON COMPUTER NETWORK, ELECTRONIC AND AUTOMATION (ICCNEA), 2017, : 45 - 48
  • [27] NAO robot obstacle avoidance based on fuzzy Q-learning
    Wen, Shuhuan
    Hu, Xueheng
    Li, Zhen
    Lam, Hak Keung
    Sun, Fuchun
    Fang, Bin
    INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2020, 47 (06): : 801 - 811
  • [28] Region-based Q-Learning for intelligent robot systems
    Suh, IH
    Kim, JH
    Oh, SR
    1997 IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN ROBOTICS AND AUTOMATION - CIRA '97, PROCEEDINGS: TOWARDS NEW COMPUTATIONAL PRINCIPLES FOR ROBOTICS AND AUTOMATION, 1997, : 172 - 178
  • [29] Research on the gait planning and the balance control of biped robot running and jumping
    Zhang, Jianrui
    Yuan, Zhaohui
    Dong, Sheng
    Zhang, Fuli
    Sadiq, Muhammad Tariq
    Liang, Na
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 125 : 126 - 126
  • [30] Design of Biped Walking Gait on Biped Robot
    Anh Nguyen Van Tien
    Hoai Quoc Le
    Thien Phuc Tran
    Tan Tien Nguyen
    2017 14TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2017, : 303 - 306