Neural Gaits: Learning Bipedal Locomotion via Control Barrier Functions and Zero Dynamics Policies

被引：0

作者：

Rodriguez, Ivan Dario Jimenez ^{[1
]}

Csomay-Shanklin, Noel ^{[1
]}

Yue, Yisong ^{[1
,2
]}

Ames, Aaron D. ^{[1
]}

机构：

[1] CALTECH, Pasadena, CA 91125 USA

[2] Argo AI, Pittsburgh, PA USA

来源：

LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 168 | 2022年 / 168卷

关键词：

bipedal locomotion; zero dynamics; safety; robotics; QUADRATIC PROGRAMS;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This work presents Neural Gaits, a method for learning dynamic walking gaits through the enforcement of set invariance that can be refined episodically using experimental data from the robot. We frame walking as a set invariance problem enforceable via control barrier functions (CBFs) defined on the reduced-order dynamics quantifying the underactuated component of the robot: the zero dynamics. Our approach contains two learning modules: one for learning a policy that satisfies the CBF condition, and another for learning a residual dynamics model to refine imperfections of the nominal model. Importantly, learning only over the zero dynamics significantly reduces the dimensionality of the learning problem while using CBFs allows us to still make guarantees for the full-order system. The method is demonstrated experimentally on an underactuated bipedal robot, where we are able to show agile and dynamic locomotion, even with partially unknown dynamics.

引用

页数：13

共 32 条

[1] Ambrose E, 2017, 2017 IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS (CCTA 2017), P1270, DOI 10.1109/CCTA.2017.8062633
[2] Ames AD, 2019, 2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), P3420, DOI [10.23919/ECC.2019.8796030, 10.23919/ecc.2019.8796030]
[3] Control Barrier Function Based Quadratic Programs for Safety Critical Systems
Ames, Aaron D.
Xu, Xiangru
Grizzle, Jessy W.
Tabuada, Paulo
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (08) : 3861 - 3876
[4] First steps toward formal controller synthesis for bipedal robots with experimental implementation
Ames, Aaron D.
Tabuada, Paulo
Jones, Austin
Ma, Wen-Loong
Rungger, Matthias
Schuermann, Bastian
Kolathaya, Shishir
Grizzle, Jessy W.
[J]. NONLINEAR ANALYSIS-HYBRID SYSTEMS, 2017, 25 : 155 - 173
[5] [Anonymous], Supplementary Video
[6] Apgar T, 2018, ROBOTICS: SCIENCE AND SYSTEMS XIV
[7] Robust Feedback Motion Policy Design Using Reinforcement Learning on a 3D Digit Bipedal Robot
Castillo, Guillermo A.
Weng, Bowen
Zhang, Wei
Hereid, Ayonga
[J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 5136 - 5143
[8] Chen R T Q., 2018, ADV NEURAL INFORM PR, Vvol 31
[9] Choi J, 2020, Arxiv, DOI arXiv:2004.07584
[10] Csomay-Shanklin N., 2021, LEARNING DYNAMICS CO, P1041

← 1 2 3 4 →