Adaptive stiffness control of passivity-based biped robot on compliant ground using double deep Q network

被引：10

作者：

Wu, Yao ^{[1
]}

Yao, Daojin ^{[1
]}

Guo, Zhao ^{[1
]}

Xiao, Xiaohui ^{[1
]}

机构：

[1] Wuhan Univ, Sch Power & Mech Engn, Wuhan 430072, Hubei, Peoples R China

来源：

PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE | 2019年 / 233卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Biped robot; deep reinforcement learning; double deep Q network; stiffness control; passive dynamic walking; WALKING; ACTUATORS; GAITS;

D O I：

10.1177/0954406218781402

中图分类号：

TH [机械、仪表工业];

学科分类号：

0802 ;

摘要：

Passive dynamic walking exhibits human-like and energy-efficient gait. Biologically inspired compliance introduced to flexible passivity-based robot would be helpful to generate stable locomotion. However, designing adaptive controller for flexible biped on compliant ground still remains a challenge. This paper aims to design an adaptive and model-free stiffness controller for passivity-based flexible biped on compliant ground, where the hip stiffness is modulated by double deep Q network. One benefit of the double deep Q network is that adaptive stiffness control policy could be directly learned from inputs. At first, passive dynamic walking gait is utilized as a reference trajectory during double deep Q network training. Then the trained double deep Q network is used as adaptive stiffness controller for biped on compliant ground. Simulation results show that the passivity-based biped robot could walk in such walking cases as disturbed initial condition, level compliant ground, downslope slippery compliant surface, and varying compliance environments. The adaptive stiffness controller would be used to make the passivity-based biped robot adapt to the environmental changes.

引用

页码：2177 / 2189

页数：13

共 40 条

[1] Abadi M., 2015, TENSORFLOW LARGE SCA, DOI DOI 10.48550/ARXIV.1603.04467
[2] [Anonymous], 1993, P TECHN REP DTIC DOC, DOI 10.5555/168871
[3] [Anonymous], 1990, FINITE ELEMENT PROCE
[4] Deep Reinforcement Learning A brief survey
Arulkumaran, Kai
Deisenroth, Marc Peter
Brundage, Miles
Bharath, Anil Anthony
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) : 26 - 38
[5] Efficient bipedal robots based on passive-dynamic walkers
Collins, S
Ruina, A
Tedrake, R
Wisse, M
[J]. SCIENCE, 2005, 307 (5712) : 1082 - 1085
[6] Development of multi-phase dynamic equations for a seven-link biped robot with improved foot rotation in the double support phase
Farzadpour, Farsam
Danesh, Mohammad
TorkLarki, Seyed M.
[J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2015, 229 (01) : 3 - 17
[7] Literature survey of contact dynamics modelling
Gilardi, G
Sharf, I
[J]. MECHANISM AND MACHINE THEORY, 2002, 37 (10) : 1213 - 1239
[8] Adaptive Neural Network Control of Serial Variable Stiffness Actuators
Guo, Zhao
Pan, Yongping
Sun, Tairen
Zhang, Yubing
Xiao, Xiaohui
[J]. COMPLEXITY, 2017,
[9] Design and control of a novel compliant differential shape memory alloy actuator
Guo, Zhao
Pan, Yongping
Wee, Liang Boon
Yu, Haoyong
[J]. SENSORS AND ACTUATORS A-PHYSICAL, 2015, 225 : 71 - 80
[10] He Frank S, 2016, ARXIV161101606

← 1 2 3 4 →