LiveCap: Real-Time Human Performance Capture From Monocular Video4

被引:152
|
作者
Habermann, Marc [1 ]
Xu, Weipeng [1 ]
Zollhofer, Michael [2 ]
Pons-Moll, Gerard [1 ]
Theobalt, Christian [1 ]
机构
[1] Max Planck Inst Informat, Campus E1 4,Stuhlsatzenhausweg, D-66123 Saarbrucken, Germany
[2] Stanford Univ, 353 Serra Mall,RM 386, Stanford, CA 94305 USA
来源
ACM TRANSACTIONS ON GRAPHICS | 2019年 / 38卷 / 02期
关键词
Monocular performance capture; 3D pose estimation; human body; non-rigid surface deformation; INTERACTING CHARACTERS; MOTION CAPTURE; SHAPE; POSE; TRACKING;
D O I
10.1145/3311970
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present the first real-time human performance capture approach that reconstructs dense, space-time coherent deforming geometry of entire humans in general everyday clothing from just a single RGB video. We propose a novel two-stage analysis-by-synthesis optimization whose formulation and implementation are designed for high performance. In the first stage, a skinned template model is jointly fitted to background subtracted input video, 2D and 3D skeleton joint positions found using a deep neural network, and a set of sparse facial landmark detections. In the second stage, dense non-rigid 3D deformations of skin and even loose apparel are captured based on a novel real-time capable algorithm for non-rigid tracking using dense photometric and silhouette constraints. Our novel energy formulation leverages automatically identified material regions on the template to model the differing non-rigid deformation behavior of skin and apparel. The two resulting non-linear optimization problems per frame are solved with specially tailored data-parallel Gauss-Newton solvers. To achieve real-time performance of over 25Hz, we design a pipelined parallel architecture using the CPU and two commodity GPUs. Our method is the first real-time monocular approach for full-body performance capture. Our method yields comparable accuracy with off-line performance capture techniques while being orders of magnitude faster.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Human Performance Capture from Monocular Video in the Wild
    Guo, Chen
    Chen, Xu
    Song, Jie
    Hilliges, Otmar
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 889 - 898
  • [2] MonoPerfCap: Human Performance Capture From Monocular Video
    Xu, Weipeng
    Chatterjee, Avishek
    Zollhoefer, Michael
    Rhodin, Helge
    Mehta, Dushyant
    Seidel, Hans-Peter
    Theobalt, Christian
    ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (02):
  • [3] Real-Time 3D Pose Reconstruction of Human Body from Monocular Video Sequences
    Zhu, LiangJia
    Hwang, Jenq-Neng
    Chen, Chih-Chang
    Lin, Ming-Hui
    Yen, Chen-Lan
    ISCAS: 2009 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-5, 2009, : 717 - +
  • [4] Monocular Real-Time Human Geometry Reconstruction
    Feng, Qiao
    Liu, Yebin
    Lai, Yu-Kun
    Yang, Jingyu
    Li, Kun
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT III, 2022, 13606 : 594 - 598
  • [5] Real-Time Facial Expression Transformation for Monocular RGB Video
    Ma, L.
    Deng, Z.
    COMPUTER GRAPHICS FORUM, 2019, 38 (01) : 470 - 481
  • [6] NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video
    Sun, Jiaming
    Xie, Yiming
    Chen, Linghao
    Zhou, Xiaowei
    Bao, Hujun
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15593 - 15602
  • [7] PanoRecon: Real-Time Panoptic 3D Reconstruction from Monocular Video
    Wu, Dong
    Yan, Zike
    Zha, Hongbin
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 21507 - 21518
  • [8] REAL TIME SPEED ESTIMATION FROM MONOCULAR VIDEO
    Temiz, M. S.
    Kulur, S.
    Dogan, S.
    XXII ISPRS CONGRESS, TECHNICAL COMMISSION III, 2012, 39-B3 : 427 - 432
  • [9] Automatic real-time capture and segmentation of endoscopy video
    Stanek, Sean R.
    Tavanapong, Wallapak
    Wong, Johnny S.
    Oh, JungHwan
    de Groen, Piet C.
    MEDICAL IMAGING 2008: PACS AND IMAGING INFORMATICS, 2008, 6919
  • [10] Real-time recognition of violent acts in monocular colour video sequences
    Mecocci, Alessandro
    Micheli, Francesco
    2007 IEEE WORKSHOP ON SIGNAL PROCESSING APPLICATIONS FOR PUBLIC SECURITY AND FORENSICS, 2007, : 2 - +