Learning an Efficient Gait Cycle of a Biped Robot Based on Reinforcement Learning and Artificial Neural Networks

被引:25
作者
Gil, Cristyan R. [1 ]
Calvo, Hiram [1 ]
Sossa, Humberto [1 ,2 ]
机构
[1] Inst Politecn Nacl, Ctr Invest Comp, Ciudad De Mexico 07738, Mexico
[2] Tecnol Monterrey, Campus Guadalajara, Zapopan 45138, Mexico
来源
APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 03期
关键词
q-learning; Q-networks; reinforcement learning; gait cycle; biped robots; BALANCE CONTROL; HUMANOID ROBOT; WALKING; GENERATION;
D O I
10.3390/app9030502
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Programming robots for performing different activities requires calculating sequences of values of their joints by taking into account many factors, such as stability and efficiency, at the same time. Particularly for walking, state of the art techniques to approximate these sequences are based on reinforcement learning (RL). In this work we propose a multi-level system, where the same RL method is used first to learn the configuration of robot joints (poses) that allow it to stand with stability, and then in the second level, we find the sequence of poses that let it reach the furthest distance in the shortest time, while avoiding falling down and keeping a straight path. In order to evaluate this, we focus on measuring the time it takes for the robot to travel a certain distance. To our knowledge, this is the first work focusing both on speed and precision of the trajectory at the same time. We implement our model in a simulated environment using q-learning. We compare with the built-in walking modes of an NAO robot by improving normal-speed and enhancing robustness in fast-speed. The proposed model can be extended to other tasks and is independent of a particular robot model.
引用
收藏
页数:24
相关论文
共 34 条
[1]  
[Anonymous], INT J ADV ROBOT SYST
[2]   Passive Walking Biped Model with Dissipative Contact and Friction Forces [J].
Corral, Eduardo ;
Marques, Filipe ;
Gomez Garcia, Maria Jesus ;
Flores, Paulo ;
Carlos Garcia-Prada, Juan .
PROCEEDINGS OF THE 7TH EUROPEAN CONFERENCE ON MECHANISM SCIENCE, EUCOMES 2018, 2019, 59 :35-42
[3]  
Dekker M., 2009, EINDHOVEN U TECHNOL
[4]   Walking with perturbations: a guide for biped humans and robots [J].
Duysens, Jacques ;
Forner-Cordero, Arturo .
BIOINSPIRATION & BIOMIMETICS, 2018, 13 (06)
[5]   Efficient Walking Gait Generation via Principal Component Representation of Optimal Trajectories: Application to a Planar Biped Robot With Elastic Joints [J].
Gasparri, Gian Maria ;
Manara, Silvia ;
Caporale, Danilo ;
Averta, Giuseppe ;
Bonilla, Manuel ;
Marino, Hamal ;
Catalano, Manuel ;
Grioli, Giorgio ;
Bianchi, Matteo ;
Bicchi, Antonio ;
Garabini, Manolo .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (03) :2299-2306
[6]  
Hong S, 2017, INT J FUZZY LOG INTE, V17, P51, DOI 10.5391/IJFIS.2017.17.2.51
[7]   Planning walking patterns for a biped robot [J].
Huang, Q ;
Yokoi, K ;
Kajita, S ;
Kaneko, K ;
Arai, H ;
Koyachi, N ;
Tanie, K .
IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 2001, 17 (03) :280-289
[8]  
Huang Q., 1999, P 1999 IEEE RSJ INT
[9]  
Hubicki C, 2018, IEEE ROBOT AUTOM MAG, V25, P23, DOI 10.1109/MRA.2017.2783922
[10]  
Hwang KS, 2016, J INF SCI ENG, V32, P1041