Gait Balance and Acceleration of a Biped Robot Based on Q-Learning

被引：37

作者：

Lin, Jin-Ling ^{[1
]}

Hwang, Kao-Shing ^{[2
]}

Jiang, Wei-Cheng ^{[2
]}

Chen, Yu-Jen ^{[3
]}

机构：

[1] Shih Hsin Univ, Dept Informat Management, Taipei 116, Taiwan

[2] Natl Sun Yat Sen Univ, Dept Elect Engn, Kaohsiung 80424, Taiwan

[3] Natl Chung Cheng Univ, Dept Elect Engn, Chiayi 62102, Taiwan

来源：

IEEE ACCESS | 2016年 / 4卷

关键词：

Reinforcement learning; biped robot; continuous action space; zero moment point; ALGORITHM;

D O I：

10.1109/ACCESS.2016.2570255

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a method for the biped dynamic walking and balance control using reinforcement learning, which learns dynamic walking without a priori knowledge about the dynamic model. The learning architecture developed is aimed to solve complex control problems in robotic actuation control by mapping the action space from a discretized domain to a continuous one. It employs the discrete actions to construct a policy for continuous action. The architecture allows for the scaling of the dimensionality of the state space and cardinality of the action set that represents new knowledge, or new requirements for a desired task. The balance learning method utilizing the motion of robot arm and leg to shift the zero moment point on the soles of a robot can maintain the biped robot in a static stable state. This balanced algorithm is applied to biped walking on a flat surface and a seesaw and is making the biped's walks more stable. The simulation shows that the proposed method can allow the robot to learn to improve its behavior in terms of walking speed. Finally, the methods are implemented on a physical biped robot to demonstrate the feasibility and effectiveness of the proposed learning scheme.

引用

页码：2439 / 2449

页数：11

共 50 条

[21] Application of role value to robot soccer based on Q-learning
School of Mechanical Engineering and Automation, Xihua University, Chengdu 610039, China
Dianzi Keji Diaxue Xuebao, 2007, 4 (809-812):
[22] Mobile robot path planning based on Q-learning algorithm
Li, Shaochuan
Wang, Xuiqing
Hu, Liwei
Liu, Ying
2019 WORLD ROBOT CONFERENCE SYMPOSIUM ON ADVANCED ROBOTICS AND AUTOMATION (WRC SARA 2019), 2019, : 160 - 165
[23] Control of the trajectory of a hexapod robot based on distributed Q-learning
Youcef, Z
Pierre, C
PROCEEDINGS OF THE IEEE-ISIE 2004, VOLS 1 AND 2, 2004, : 277 - 282
[24] A robot demonstration method based on LWR and Q-learning algorithm
Zhao, Guangzhe
Tao, Yong
Liu, Hui
Deng, Xianling
Chen, Youdong
Xiong, Hegen
Xie, Xianwu
Fang, Zengliang
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 35 (01) : 35 - 46
[25] Research on intelligence robot formation based on fuzzy Q-Learning
Zhang, RB
Shi, Y
PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1936 - 1941
[26] Behavior Control Algorithm for Mobile Robot Based on Q-Learning
Yang, Shiqiang
Li, Congxiao
2017 INTERNATIONAL CONFERENCE ON COMPUTER NETWORK, ELECTRONIC AND AUTOMATION (ICCNEA), 2017, : 45 - 48
[27] NAO robot obstacle avoidance based on fuzzy Q-learning
Wen, Shuhuan
Hu, Xueheng
Li, Zhen
Lam, Hak Keung
Sun, Fuchun
Fang, Bin
INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2020, 47 (06): : 801 - 811
[28] Region-based Q-Learning for intelligent robot systems
Suh, IH
Kim, JH
Oh, SR
1997 IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN ROBOTICS AND AUTOMATION - CIRA '97, PROCEEDINGS: TOWARDS NEW COMPUTATIONAL PRINCIPLES FOR ROBOTICS AND AUTOMATION, 1997, : 172 - 178
[29] Research on the gait planning and the balance control of biped robot running and jumping
Zhang, Jianrui
Yuan, Zhaohui
Dong, Sheng
Zhang, Fuli
Sadiq, Muhammad Tariq
Liang, Na
BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 125 : 126 - 126
[30] Design of Biped Walking Gait on Biped Robot
Anh Nguyen Van Tien
Hoai Quoc Le
Thien Phuc Tran
Tan Tien Nguyen
2017 14TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2017, : 303 - 306

← 1 2 3 4 5 →