Adaptive stiffness control of passivity-based biped robot on compliant ground using double deep Q network

被引:10
作者
Wu, Yao [1 ]
Yao, Daojin [1 ]
Guo, Zhao [1 ]
Xiao, Xiaohui [1 ]
机构
[1] Wuhan Univ, Sch Power & Mech Engn, Wuhan 430072, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Biped robot; deep reinforcement learning; double deep Q network; stiffness control; passive dynamic walking; WALKING; ACTUATORS; GAITS;
D O I
10.1177/0954406218781402
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Passive dynamic walking exhibits human-like and energy-efficient gait. Biologically inspired compliance introduced to flexible passivity-based robot would be helpful to generate stable locomotion. However, designing adaptive controller for flexible biped on compliant ground still remains a challenge. This paper aims to design an adaptive and model-free stiffness controller for passivity-based flexible biped on compliant ground, where the hip stiffness is modulated by double deep Q network. One benefit of the double deep Q network is that adaptive stiffness control policy could be directly learned from inputs. At first, passive dynamic walking gait is utilized as a reference trajectory during double deep Q network training. Then the trained double deep Q network is used as adaptive stiffness controller for biped on compliant ground. Simulation results show that the passivity-based biped robot could walk in such walking cases as disturbed initial condition, level compliant ground, downslope slippery compliant surface, and varying compliance environments. The adaptive stiffness controller would be used to make the passivity-based biped robot adapt to the environmental changes.
引用
收藏
页码:2177 / 2189
页数:13
相关论文
共 40 条
  • [1] Abadi M., 2015, TENSORFLOW LARGE SCA, DOI DOI 10.48550/ARXIV.1603.04467
  • [2] [Anonymous], 1993, P TECHN REP DTIC DOC, DOI 10.5555/168871
  • [3] [Anonymous], 1990, FINITE ELEMENT PROCE
  • [4] Deep Reinforcement Learning A brief survey
    Arulkumaran, Kai
    Deisenroth, Marc Peter
    Brundage, Miles
    Bharath, Anil Anthony
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) : 26 - 38
  • [5] Efficient bipedal robots based on passive-dynamic walkers
    Collins, S
    Ruina, A
    Tedrake, R
    Wisse, M
    [J]. SCIENCE, 2005, 307 (5712) : 1082 - 1085
  • [6] Development of multi-phase dynamic equations for a seven-link biped robot with improved foot rotation in the double support phase
    Farzadpour, Farsam
    Danesh, Mohammad
    TorkLarki, Seyed M.
    [J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2015, 229 (01) : 3 - 17
  • [7] Literature survey of contact dynamics modelling
    Gilardi, G
    Sharf, I
    [J]. MECHANISM AND MACHINE THEORY, 2002, 37 (10) : 1213 - 1239
  • [8] Adaptive Neural Network Control of Serial Variable Stiffness Actuators
    Guo, Zhao
    Pan, Yongping
    Sun, Tairen
    Zhang, Yubing
    Xiao, Xiaohui
    [J]. COMPLEXITY, 2017,
  • [9] Design and control of a novel compliant differential shape memory alloy actuator
    Guo, Zhao
    Pan, Yongping
    Wee, Liang Boon
    Yu, Haoyong
    [J]. SENSORS AND ACTUATORS A-PHYSICAL, 2015, 225 : 71 - 80
  • [10] He Frank S, 2016, ARXIV161101606