SFV: Reinforcement Learning of Physical Skills from Videos

被引:0
作者
Peng, Xue Bin [1 ]
Kanazawa, Angjoo [1 ]
Malik, Jitendra [1 ]
Abbeel, Pieter [1 ]
Levine, Sergey [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
来源
SIGGRAPH ASIA'18: SIGGRAPH ASIA 2018 TECHNICAL PAPERS | 2018年
基金
加拿大自然科学与工程研究理事会;
关键词
physics-based character animation; computer vision; video imitation; reinforcement learning; motion reconstruction;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Data-driven character animation based on motion capture can produce highly naturalistic behaviors and, when combined with physics simulation, can provide for natural procedural responses to physical perturbations, environmental changes, and morphological discrepancies. Motion capture remains the most popular source of motion data, but collecting mocap data typically requires heavily instrumented environments and actors. In this paper, we propose a method that enables physically simulated characters to learn skills from videos (SFV). Our approach, based on deep pose estimation and deep reinforcement learning, allows data-driven animation to leverage the abundance of publicly available video clips from the web, such as those from YouTube. This has the potential to enable fast and easy design of character controllers simply by querying for video recordings of the desired behavior. The resulting controllers are robust to perturbations, can be adapted to new settings, can perform basic object interactions, and can be retargeted to new morphologies via reinforcement learning. We further demonstrate that our method can predict potential human motions from still images, by forward simulation of learned controllers initialized from the observed pose. Our framework is able to learn a broad range of dynamic skills, including locomotion, acrobatics, and martial arts. (Video(1))
引用
收藏
页数:14
相关论文
共 63 条
  • [1] [Anonymous], 2015, CoRR
  • [2] [Anonymous], B PHYS LIB
  • [3] [Anonymous], 2017, CoRR
  • [4] Aslam S., 2018, Youtube by the numbers
  • [5] Terrain-Adaptive Locomotion Skills Using Deep Reinforcement Learning
    Bin Peng, Xue
    Berseth, Glen
    van de Panne, Michiel
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (04):
  • [6] Keep It SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image
    Bogo, Federica
    Kanazawa, Angjoo
    Lassner, Christoph
    Gehler, Peter
    Romero, Javier
    Black, Michael J.
    [J]. COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 561 - 578
  • [7] Tracking people with twists and exponential maps
    Bregler, C
    Malik, J
    [J]. 1998 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1998, : 8 - 15
  • [8] Brockman Greg, 2016, arXiv
  • [9] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
    Cao, Zhe
    Simon, Tomas
    Wei, Shih-En
    Sheikh, Yaser
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1302 - 1310
  • [10] Locomotion Skills for Simulated Quadrupeds
    Coros, Stelian
    Karpathy, Andrej
    Jones, Ben
    Reveret, Lionel
    van de Panne, Michiel
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2011, 30 (04):