Neural Body Fitting: Unifying Deep Learning and Model Based Human Pose and Shape Estimation

被引:363
作者
Omran, Mohamed [1 ]
Lassner, Christoph [2 ]
Pons-Moll, Gerard [1 ]
Gehler, Peter V. [2 ]
Schiele, Bernt [1 ]
机构
[1] Max Planck Inst Informat, Saarland Informat Campus, Saarbrucken, Germany
[2] Amazon, Tubingen, Germany
来源
2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV) | 2018年
关键词
D O I
10.1109/3DV.2018.00062
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Direct prediction of 3D body pose and shape remains a challenge even for highly parameterized deep learning models. Mapping from the 2D image space to the prediction space is difficult: perspective ambiguities make the loss function noisy and training data is scarce. In this paper, we propose a novel approach (Neural Body Fitting (NBF)). It integrates a statistical body model within a CNN, leveraging reliable bottom-up semantic body part segmentation and robust top-down body model constraints. NBF is fully differentiable and can be trained using 2D and 3D annotations. In detailed experiments, we analyze how the components of our model affect performance, especially the use of part segmentations as an explicit intermediate representation, and present a robust, efficiently trainable framework for 3D human pose estimation from 2D images with competitive results on standard benchmarks. Code will be made available at http://github.com/mohomran/neural_body_fitting
引用
收藏
页码:484 / 494
页数:11
相关论文
共 70 条
[1]  
Akhter I, 2015, PROC CVPR IEEE, P1446, DOI 10.1109/CVPR.2015.7298751
[2]  
Alldieck Thiemo, 2018, 3DV
[3]   SCAPE: Shape Completion and Animation of People [J].
Anguelov, D ;
Srinivasan, P ;
Koller, D ;
Thrun, S ;
Rodgers, J ;
Davis, J .
ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (03) :408-416
[4]  
[Anonymous], 2018, P IEEE C COMP VIS PA
[5]  
[Anonymous], 2010, BMVC
[6]  
[Anonymous], ARXIV180101615
[7]  
[Anonymous], 2015, Very Deep Convolu- tional Networks for Large-Scale Image Recognition
[8]  
[Anonymous], EUR C COMP VIS ECCV
[9]  
[Anonymous], 2015 IEEE C COMP VIS
[10]  
[Anonymous], 2018, ARXIV180300455