Physics-based Motion Capture Imitation with Deep Reinforcement Learning

被引：49

作者：

Chentanez, Nuttapong ^{[1
,2
]}

Muller, Matthias ^{[2
]}

Macklin, Miles ^{[2
]}

Makoviychuk, Viktor ^{[2
]}

Jeschke, Stefan ^{[2
]}

机构：

[1] Chulalongkorn Univ, Fac Engn, Dept Comp Engn, Bangkok, Thailand

[2] NVIDIA Res, Santa Clara, CA 95051 USA

来源：

ACM SIGGRAPH CONFERENCE ON MOTION, INTERACTION, AND GAMES (MIG 2018) | 2018年

关键词：

Mocap; Character Controller; Neural Network; Deep Learning; Simulation;

D O I：

10.1145/3274247.3274506

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

We introduce a deep reinforcement learning method that learns to control articulated humanoid bodies to imitate given target motions closely when simulated in a physics simulator. The target motion, which may not have been seen by the agent and can be noisy, is supplied at runtime. Our method can recover balance from moderate external disturbances and keep imitating the target motion. When subjected to large disturbances that cause the humanoid to fall down, our method can control the character to get up and recover to track the motion. Our method is trained to imitate the mocap clips from the CMU motion capture database and a number of other publicly available databases. We use a state-of-the-art deep reinforcement learning algorithm to learn to dynamically control the gain of PD controllers, whose target angles are derived from the mocap clip and to apply corrective torques with the goal of imitating the provided motion clip as closely as possible. Both the simulation and the learning algorithms are parallelized and run on the GPU. We demonstrate that the proposed method can control the character to imitate a wide variety of motions such as running, walking, dancing, jumping, kicking, punching, standing up, and so on.

引用

页数：10

共 50 条

[1]

Abadi M., 2016, TENSORFLOW LARGESCAL

[2] Trajectory Optimization for Full-Body Movements with Complex Contacts [J].

Al Borno, Mazen ;

de Lasa, Martin ;

Hertzmann, Aaron .

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2013, 19 (08) :1405-1414

[3]

[Anonymous], 2015, CoRR

[4]

[Anonymous], 2014, Robotics: Science and Systems (RSS)

[5]

[Anonymous], ICML 14

[6]

Beamer S, 2010, CONF PROC INT SYMP C, P129, DOI 10.1145/1816038.1815978

[7]

Berseth Glen, 2018, INT C LEARN REPR

[8] Video Deblurring for Hand-held Cameras Using Patch-based Synthesis [J].

Cho, Sunghyun ;

Wang, Jue ;

Lee, Seungyong .

ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (04)

[9]

Cooper Joseph L., 2012, Motion in Games. 5th International Conference (MIG 2012). Proceedings, P350, DOI 10.1007/978-3-642-34710-8_32

[10] Generalized Biped Walking Control [J].

Coros, Stelian ;

Beaudoin, Philippe ;

van de Panne, Michiel .

ACM TRANSACTIONS ON GRAPHICS, 2010, 29 (04)

← 1 2 3 4 5 →