Reinforcement learning control for a three-link biped robot with energy-efficient periodic gaits

被引:2
|
作者
Pan, Zebang [1 ]
Yin, Shan [1 ]
Wen, Guilin [2 ]
Tan, Zhao [1 ]
机构
[1] Hunan Univ, State Key Lab Adv Design & Manufacture Vehicle Bod, Changsha 410082, Peoples R China
[2] Yanshan Univ, Sch Mech Engn, Qinhuangdao 066004, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Three-link biped robot; Deep Reinforcement learning; Periodic gaits; Energy optimization; STABLE WALKING; LOCOMOTION; COST; FEET;
D O I
10.1007/s10409-022-22304-x
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Designing a high-performance controller for the walking gaits of biped robots remains an open research area due to their strong nonlinearity and non-smooth responses. To overcome such challenges, a humanoid robot with a torso, i.e., a three-link biped robot involving both impact and friction, is developed firstly. Then, the twin delayed deep deterministic policy gradient algorithm is adopted to design the reinforcement learning controller for the proposed biped robot. For the specified control targets, i.e., energy-efficient periodic gaits for both the downhill and uphill cases, a reward function utilizing the Poincare map and the power function is constructed to provide guidelines for the controller. Thus, the proposed controller can learn to adaptively output accurate cosine torques to achieve the goal without relying on the pre-designed reference trajectories or embedded unstable periodic gaits. A comparative study between the proposed reinforcement learning and neural network proportion differentiation controllers demonstrates the proposed controller can lead to accurate and energy-efficient periodic gaits and provide strong adaptability and robustness within a wide variety of walking slopes.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] Analysis and control of three-link planar robot with active last link: property and energy-based approach
    Xin Xin
    Yongjia Wang
    Nonlinear Dynamics, 2024, 112 : 5269 - 5289
  • [22] Reinforcement learning for energy-efficient control of parallel and identical machines
    Loffredo, Alberto
    May, Marvin Carl
    Schaefer, Louis
    Matta, Andrea
    Lanza, Gisela
    CIRP JOURNAL OF MANUFACTURING SCIENCE AND TECHNOLOGY, 2023, 44 : 91 - 103
  • [23] Position control of a three-link shape memory alloy actuated robot
    Ashrafiuon, H
    Eshraghi, M
    Elahinia, MH
    JOURNAL OF INTELLIGENT MATERIAL SYSTEMS AND STRUCTURES, 2006, 17 (05) : 381 - 392
  • [24] Decoupling Robust Control of Three-Link Direct Drive Robot Arm
    Zhang, C.
    Zhao, G.
    Xiao, Y.
    Yang, X.
    MANUFACTURING SCIENCE AND MATERIALS ENGINEERING, PTS 1 AND 2, 2012, 443-444 : 258 - +
  • [25] Reinforcement learning control for biped robot walking on uneven surfaces
    Wang, Shouyi
    Braaksma, Jelmer
    Babuska, Robert
    Hobbelen, Daan
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 4173 - +
  • [26] Deep reinforcement learning method for biped robot gait control
    Feng C.
    Zhang Y.
    Huang C.
    Jiang W.
    Wu Z.
    1600, CIMS (27): : 2341 - 2349
  • [27] Hybrid reinforcement learning and its application to biped robot control
    Yamada, S
    Watanabe, A
    Nakashima, M
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 10, 1998, 10 : 1071 - 1077
  • [28] Modelling and Control of Three-link Planar Robot for Stable Walking Gait
    Kochuvila, Sreeja
    Vardhan, Nandi H. R.
    2018 3RD INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER, AND OPTIMIZATION TECHNIQUES (ICEECCOT - 2018), 2018, : 851 - 855
  • [29] Autonomous swing-up control of a three-link robot gymnast
    Eldukhri, E. E.
    Pham, D. T.
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART I-JOURNAL OF SYSTEMS AND CONTROL ENGINEERING, 2010, 224 (I7) : 825 - 833
  • [30] Energy-efficient heating control for smart buildings with deep reinforcement learning
    Gupta, Anchal
    Badr, Youakim
    Negahban, Ashkan
    Qiu, Robin G.
    JOURNAL OF BUILDING ENGINEERING, 2021, 34