Reinforcement learning control for a three-link biped robot with energy-efficient periodic gaits

被引:2
|
作者
Pan, Zebang [1 ]
Yin, Shan [1 ]
Wen, Guilin [2 ]
Tan, Zhao [1 ]
机构
[1] Hunan Univ, State Key Lab Adv Design & Manufacture Vehicle Bod, Changsha 410082, Peoples R China
[2] Yanshan Univ, Sch Mech Engn, Qinhuangdao 066004, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Three-link biped robot; Deep Reinforcement learning; Periodic gaits; Energy optimization; STABLE WALKING; LOCOMOTION; COST; FEET;
D O I
10.1007/s10409-022-22304-x
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Designing a high-performance controller for the walking gaits of biped robots remains an open research area due to their strong nonlinearity and non-smooth responses. To overcome such challenges, a humanoid robot with a torso, i.e., a three-link biped robot involving both impact and friction, is developed firstly. Then, the twin delayed deep deterministic policy gradient algorithm is adopted to design the reinforcement learning controller for the proposed biped robot. For the specified control targets, i.e., energy-efficient periodic gaits for both the downhill and uphill cases, a reward function utilizing the Poincare map and the power function is constructed to provide guidelines for the controller. Thus, the proposed controller can learn to adaptively output accurate cosine torques to achieve the goal without relying on the pre-designed reference trajectories or embedded unstable periodic gaits. A comparative study between the proposed reinforcement learning and neural network proportion differentiation controllers demonstrates the proposed controller can lead to accurate and energy-efficient periodic gaits and provide strong adaptability and robustness within a wide variety of walking slopes.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] Energy-efficient bio-inspired gait planning and control for biped robot based on human locomotion analysis
    Hongbo Zhu
    Minzhou Luo
    Tao Mei
    Jianghai Zhao
    Tao Li
    Fayong Guo
    Journal of Bionic Engineering, 2016, 13 : 271 - 282
  • [42] Energy-Efficient Slithering Gait Exploration for a Snake-Like Robot Based on Reinforcement Learning
    Bing, Zhenshan
    Lemke, Christian
    Jiang, Zhuangyi
    Huang, Kai
    Knoll, Alois
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 5663 - 5669
  • [43] Dynamic model and motion control analysis of three-link gymnastic robot on horizontal bar
    Xie, J
    Li, ZS
    2003 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS, INTELLIGENT SYSTEMS AND SIGNAL PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2003, : 83 - 87
  • [44] Control strategy for a novel class of three-link underactuated manipulators named PPA robot
    Lai, Xu-Zhi
    Pan, Chang-Zhong
    Wu, Min
    Kongzhi yu Juece/Control and Decision, 2011, 26 (07): : 1004 - 1008
  • [45] Energy-efficient Bio-inspired Gait Planning and Control for Biped Robot Based on Human Locomotion Analysis
    Zhu, Hongbo
    Luo, Minzhou
    Mei, Tao
    Zhao, Jianghai
    Li, Tao
    Guo, Fayong
    JOURNAL OF BIONIC ENGINEERING, 2016, 13 (02) : 271 - 282
  • [46] A Disturbance Rejection Control Method Based on Deep Reinforcement Learning for a Biped Robot
    Liu, Chuzhao
    Gao, Junyao
    Tian, Dingkui
    Zhang, Xuefeng
    Liu, Huaxin
    Meng, Libo
    APPLIED SCIENCES-BASEL, 2021, 11 (04): : 1 - 17
  • [47] Stability Control of a Biped Robot on a Dynamic Platform Based on Hybrid Reinforcement Learning
    Xi, Ao
    Chen, Chao
    SENSORS, 2020, 20 (16) : 1 - 21
  • [48] Energy-Efficient Reinforcement Learning for Motion Planning of AUV
    Wen, Jiayi
    Zhu, Jingwei
    Lin, Yejin
    Zhang, Guichen
    2022 IEEE 9TH INTERNATIONAL CONFERENCE ON UNDERWATER SYSTEM TECHNOLOGY: THEORY AND APPLICATIONS, USYS, 2022,
  • [49] Energy-efficient heating control for nearly zero energy residential buildings with deep reinforcement learning
    Qin, Haosen
    Yu, Zhen
    Li, Tailu
    Liu, Xueliang
    Li, Li
    ENERGY, 2023, 264
  • [50] Fast Reinforcement Learning for Energy-Efficient Wireless Communication
    Mastronarde, Nicholas
    van der Schaar, Mihaela
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2011, 59 (12) : 6262 - 6266