DeepLoco: Dynamic Locomotion Skills Using Hierarchical Deep Reinforcement Learning

被引：360

作者：

Peng, Xue Bin ^{[1
]}

Berseth, Glen ^{[1
]}

Yin, Kangkang ^{[2
]}

Van De Panne, Michiel ^{[1
]}

机构：

[1] Univ British Columbia, Vancouver, BC, Canada

[2] Natl Univ Singapore, Singapore, Singapore

来源：

ACM TRANSACTIONS ON GRAPHICS | 2017年 / 36卷 / 04期

关键词：

physics-based character animation; motion control; locomotion skills;

D O I：

10.1145/3072959.3073602

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Learning physics-based locomotion skills is a difficult problem, leading to solutions that typically exploit prior knowledge of various forms. In this paper we aim to learn a variety of environment-aware locomotion skills with a limited amount of prior knowledge. We adopt a two-level hierarchical control framework. First, low-level controllers are learned that operate at a fine timescale and which achieve robust walking gaits that satisfy stepping-target and style objectives. Second, high-level controllers are then learned which plan at the timescale of steps by invoking desired step targets for the low-level controller. The high-level controller makes decisions directly based on high-dimensional inputs, including terrain maps or other suitable representations of the surroundings. Both levels of the control policy are trained using deep reinforcement learning. Results are demonstrated on a simulated 3D biped. Low-level controllers are learned for a variety of motion styles and demonstrate robustness with respect to force-based disturbances, terrain variations, and style interpolation. High-level controllers are demonstrated that are capable of following trails through terrains, dribbling a soccer ball towards a target location, and navigating through static or dynamic obstacles.

引用

页数：13

共 56 条

[1] Trajectory Optimization for Full-Body Movements with Complex Contacts [J].

Al Borno, Mazen ;

de Lasa, Martin ;

Hertzmann, Aaron .

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2013, 19 (08) :1405-1414

[2]

[Anonymous], 2014, Robotics: Science and Systems

[3]

[Anonymous], CORR

[4]

[Anonymous], 2010, ACM T GRAPHIC

[5]

[Anonymous], 2016, P 4 INT C LEARN REPR

[6]

[Anonymous], 2016, CORR

[7]

[Anonymous], 2015, CoRR

[8]

[Anonymous], 2016, CORR

[9]

[Anonymous], 2005, P ACM SIGGRAPH EUR S

[10]

[Anonymous], 2016, P INT C LEARNING REP

← 1 2 3 4 5 6 →