Understanding the stability of deep control policies for biped locomotion

被引:3
|
作者
Park, Hwangpil [1 ,3 ]
Yu, Ri [1 ]
Lee, Yoonsang [4 ]
Lee, Kyungho [5 ]
Lee, Jehee [2 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Seoul Natl Univ, Dept Comp Sci & Engn, Seoul, South Korea
[3] Samsung Elect, Suwon, South Korea
[4] Hanyang Univ, Comp Sci, Seoul, South Korea
[5] NC Soft, Sungnam, South Korea
来源
VISUAL COMPUTER | 2023年 / 39卷 / 01期
关键词
Biped locomotion; Deep reinforcement learning; Gait analysis; Physically based simulation; Push-recovery stability; RECOVERY;
D O I
10.1007/s00371-021-02342-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Achieving stability and robustness is the primary goal of biped locomotion control. Recently, deep reinforcement learning (DRL) has attracted great attention as a general methodology for constructing biped control policies and demonstrated significant improvements over the previous state-of-the-art control methods. Although deep control policies are more advantageous compared with previous controller design approaches, many questions remain: Are deep control policies as robust as human walking? Does simulated walking involve strategies similar to human walking for maintaining balance? Does a particular gait pattern affect human and simulated walking similarly? What do deep policies learn to achieve improved gait stability? The goal of this study is to address these questions by evaluating the push-recovery stability of deep policies compared with those of human subjects and a previous feedback controller. Furthermore, we conducted experiments to evaluate the effectiveness of variants of DRL algorithms.
引用
收藏
页码:473 / 487
页数:15
相关论文
共 50 条
  • [41] Contribution to the modeling of nonsmooth multipoint contact dynamics of biped locomotion - Theory and experiments
    Rodic, Aleksandar
    Vukobratovic, Miomir
    Addi, Khalid
    Dalleau, Georges
    ROBOTICA, 2008, 26 (26) : 157 - 175
  • [42] Dynamic programming in reduced dimensional spaces: Dynamic planning for robust biped locomotion
    Stilman, M
    Atkeson, CG
    Kuffner, JJ
    Zeglin, G
    2005 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-4, 2005, : 2399 - 2404
  • [43] Biped Locomotion of a 21-DOF Humanoid Robot for Application in Real Environment
    Yussof, Hanafiah
    INTERNATIONAL SYMPOSIUM ON ROBOTICS AND INTELLIGENT SENSORS 2012 (IRIS 2012), 2012, 41 : 1566 - 1572
  • [44] Parameter Optimization of a Signal-Based Biped Locomotion Approach Using Evolutionary Strategies
    Gokce, Barus
    Akin, H. Levent
    MOBILE ROBOTICS-SOLUTIONS AND CHALLENGES, 2010, : 733 - 740
  • [45] Automatic Characterization of Phase Resetting Controllers for Quick Balance Recovery During Biped Locomotion
    Cristiano, Julian
    Puig, Domenec
    Angel Garcia, Miguel
    ROBOT 2017: THIRD IBERIAN ROBOTICS CONFERENCE, VOL 2, 2018, 694 : 91 - 101
  • [46] Biped robot design powered by antagonistic pneumatic actuators for multi-modal locomotion
    Hosoda, Koh
    Takuma, Takashi
    Nakamoto, Atsushi
    Hayashi, Shinji
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2008, 56 (01) : 46 - 53
  • [47] Impact of Control Frequency on Deep RL-Based Torque Controller for Bipedal Locomotion
    Cha, Junhyeok
    Kim, Donghyeon
    Park, Jaeheung
    EXPERIMENTAL ROBOTICS, ISER 2023, 2024, 30 : 525 - 534
  • [48] Applying Evolution Strategies for Biped Locomotion Learning in RoboCup 3D Soccer Simulation
    Uchitane, Takeshi
    Hatanaka, Toshiharu
    2011 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2011, : 179 - 185
  • [49] High-speed and energy-efficient biped locomotion based on Virtual Slope Walking
    Dong, Hao
    Zhao, Mingguo
    Zhang, Naiyao
    AUTONOMOUS ROBOTS, 2011, 30 (02) : 199 - 216
  • [50] A Robust Biped Locomotion Based on Linear-Quadratic-Gaussian Controller and Divergent Component of Motion
    Kasaei, Mohammadreza
    Lau, Nuno
    Pereira, Artur
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 1429 - 1434