Neural Gaits: Learning Bipedal Locomotion via Control Barrier Functions and Zero Dynamics Policies

被引:0
作者
Rodriguez, Ivan Dario Jimenez [1 ]
Csomay-Shanklin, Noel [1 ]
Yue, Yisong [1 ,2 ]
Ames, Aaron D. [1 ]
机构
[1] CALTECH, Pasadena, CA 91125 USA
[2] Argo AI, Pittsburgh, PA USA
来源
LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 168 | 2022年 / 168卷
关键词
bipedal locomotion; zero dynamics; safety; robotics; QUADRATIC PROGRAMS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work presents Neural Gaits, a method for learning dynamic walking gaits through the enforcement of set invariance that can be refined episodically using experimental data from the robot. We frame walking as a set invariance problem enforceable via control barrier functions (CBFs) defined on the reduced-order dynamics quantifying the underactuated component of the robot: the zero dynamics. Our approach contains two learning modules: one for learning a policy that satisfies the CBF condition, and another for learning a residual dynamics model to refine imperfections of the nominal model. Importantly, learning only over the zero dynamics significantly reduces the dimensionality of the learning problem while using CBFs allows us to still make guarantees for the full-order system. The method is demonstrated experimentally on an underactuated bipedal robot, where we are able to show agile and dynamic locomotion, even with partially unknown dynamics.
引用
收藏
页数:13
相关论文
共 32 条
  • [1] Ambrose E, 2017, 2017 IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS (CCTA 2017), P1270, DOI 10.1109/CCTA.2017.8062633
  • [2] Ames AD, 2019, 2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), P3420, DOI [10.23919/ECC.2019.8796030, 10.23919/ecc.2019.8796030]
  • [3] Control Barrier Function Based Quadratic Programs for Safety Critical Systems
    Ames, Aaron D.
    Xu, Xiangru
    Grizzle, Jessy W.
    Tabuada, Paulo
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (08) : 3861 - 3876
  • [4] First steps toward formal controller synthesis for bipedal robots with experimental implementation
    Ames, Aaron D.
    Tabuada, Paulo
    Jones, Austin
    Ma, Wen-Loong
    Rungger, Matthias
    Schuermann, Bastian
    Kolathaya, Shishir
    Grizzle, Jessy W.
    [J]. NONLINEAR ANALYSIS-HYBRID SYSTEMS, 2017, 25 : 155 - 173
  • [5] [Anonymous], Supplementary Video
  • [6] Apgar T, 2018, ROBOTICS: SCIENCE AND SYSTEMS XIV
  • [7] Robust Feedback Motion Policy Design Using Reinforcement Learning on a 3D Digit Bipedal Robot
    Castillo, Guillermo A.
    Weng, Bowen
    Zhang, Wei
    Hereid, Ayonga
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 5136 - 5143
  • [8] Chen R T Q., 2018, ADV NEURAL INFORM PR, Vvol 31
  • [9] Choi J, 2020, Arxiv, DOI arXiv:2004.07584
  • [10] Csomay-Shanklin N., 2021, LEARNING DYNAMICS CO, P1041