SFV: Reinforcement Learning of Physical Skills from Videos

被引：0

作者：

Peng, Xue Bin ^{[1
]}

Kanazawa, Angjoo ^{[1
]}

Malik, Jitendra ^{[1
]}

Abbeel, Pieter ^{[1
]}

Levine, Sergey ^{[1
]}

机构：

[1] Univ Calif Berkeley, Berkeley, CA 94720 USA

来源：

SIGGRAPH ASIA'18: SIGGRAPH ASIA 2018 TECHNICAL PAPERS | 2018年

基金：

加拿大自然科学与工程研究理事会;

关键词：

physics-based character animation; computer vision; video imitation; reinforcement learning; motion reconstruction;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Data-driven character animation based on motion capture can produce highly naturalistic behaviors and, when combined with physics simulation, can provide for natural procedural responses to physical perturbations, environmental changes, and morphological discrepancies. Motion capture remains the most popular source of motion data, but collecting mocap data typically requires heavily instrumented environments and actors. In this paper, we propose a method that enables physically simulated characters to learn skills from videos (SFV). Our approach, based on deep pose estimation and deep reinforcement learning, allows data-driven animation to leverage the abundance of publicly available video clips from the web, such as those from YouTube. This has the potential to enable fast and easy design of character controllers simply by querying for video recordings of the desired behavior. The resulting controllers are robust to perturbations, can be adapted to new settings, can perform basic object interactions, and can be retargeted to new morphologies via reinforcement learning. We further demonstrate that our method can predict potential human motions from still images, by forward simulation of learned controllers initialized from the observed pose. Our framework is able to learn a broad range of dynamic skills, including locomotion, acrobatics, and martial arts. (Video(1))

引用

页数：14

共 63 条

[1] [Anonymous], 2015, CoRR
[2] [Anonymous], B PHYS LIB
[3] [Anonymous], 2017, CoRR
[4] Aslam S., 2018, Youtube by the numbers
[5] Terrain-Adaptive Locomotion Skills Using Deep Reinforcement Learning
Bin Peng, Xue
Berseth, Glen
van de Panne, Michiel
[J]. ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (04):
[6] Keep It SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image
Bogo, Federica
Kanazawa, Angjoo
Lassner, Christoph
Gehler, Peter
Romero, Javier
Black, Michael J.
[J]. COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 561 - 578
[7] Tracking people with twists and exponential maps
Bregler, C
Malik, J
[J]. 1998 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1998, : 8 - 15
[8] Brockman Greg, 2016, arXiv
[9] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
Cao, Zhe
Simon, Tomas
Wei, Shih-En
Sheikh, Yaser
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1302 - 1310
[10] Locomotion Skills for Simulated Quadrupeds
Coros, Stelian
Karpathy, Andrej
Jones, Ben
Reveret, Lionel
van de Panne, Michiel
[J]. ACM TRANSACTIONS ON GRAPHICS, 2011, 30 (04):

← 1 2 3 4 5 6 7 →