Understanding the stability of deep control policies for biped locomotion

被引:0
作者
Hwangpil Park
Ri Yu
Yoonsang Lee
Kyungho Lee
Jehee Lee
机构
[1] Seoul National University,
[2] Samsung Electronics,undefined
[3] Hanyang University,undefined
[4] NC Soft,undefined
来源
The Visual Computer | 2023年 / 39卷
关键词
Biped locomotion; Deep reinforcement learning; Gait analysis; Physically based simulation; Push-recovery stability;
D O I
暂无
中图分类号
学科分类号
摘要
Achieving stability and robustness is the primary goal of biped locomotion control. Recently, deep reinforcement learning (DRL) has attracted great attention as a general methodology for constructing biped control policies and demonstrated significant improvements over the previous state-of-the-art control methods. Although deep control policies are more advantageous compared with previous controller design approaches, many questions remain: Are deep control policies as robust as human walking? Does simulated walking involve strategies similar to human walking for maintaining balance? Does a particular gait pattern affect human and simulated walking similarly? What do deep policies learn to achieve improved gait stability? The goal of this study is to address these questions by evaluating the push-recovery stability of deep policies compared with those of human subjects and a previous feedback controller. Furthermore, we conducted experiments to evaluate the effectiveness of variants of DRL algorithms.
引用
收藏
页码:473 / 487
页数:14
相关论文
共 146 条
[1]  
Al Borno M(2013)Trajectory optimization for full-body movements with complex contacts IEEE Trans. Visual Comput. Gr. 19 1405-1414
[2]  
De Lasa M(2019)Drecon: data-driven responsive control of physics-based characters ACM Trans. Gr. 38 1-11
[3]  
Hertzmann A(2001)The interacting effects of cognitive demand and recovery of postural stability in balance-impaired elderly persons J. Gerontol. A Biol. Sci. Med. Sci. 56 489-496
[4]  
Bergamin K(2007)Assistive devices for gait in Parkinson’s disease Parkinsonism Related Disorders 13 133-138
[5]  
Clavet S(2010)Generalized biped walking control ACM Trans. Gr. 29 1-9
[6]  
Holden D(2008)Simulation of human motion data using short-horizon model-predictive control Comput. Gr. Forum 27 371-380
[7]  
Forbes JR(2000)Local dynamic stability versus kinematic variability of continuous overground and treadmill walking J. Biomech. Eng. 123 27-32
[8]  
Brauer SG(2020)Learned motion matching ACM Trans. Gr. 39 1-13
[9]  
Woollacott M(2019)Physics-based full-body soccer motion control for dribbling and shooting ACM Trans. Gr. 38 1-12
[10]  
Shumway-Cook A(2018)Style-based biped walking control Vis. Comput. 34 359-375