Photorealistic Monocular 3D Reconstruction of Humans Wearing Clothing

被引：75

作者：

Alldieck, Thiemo ^{[1
]}

Zanfir, Mihai ^{[1
]}

Sminchisescu, Cristian ^{[1
]}

机构：

[1] Google Res, Mountain View, CA 94043 USA

来源：

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年

关键词：

D O I：

10.1109/CVPR52688.2022.00156

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present PHORHUM, a novel, end-to-end trainable, deep neural network methodology for photorealistic 3D human reconstruction given just a monocular RGB image. Our pixel-aligned method estimates detailed 3D geometry and, for the first time, the unshaded surface color together with the scene illumination. Observing that 3D supervision alone is not sufficient for high fidelity color reconstruction, we introduce patch-based rendering losses that enable reliable color reconstruction on visible parts of the human, and detailed and plausible color estimation for the non-visible parts. Moreover, our method specifically addresses methodological and practical limitations of prior work in terms of representing geometry, albedo, and illumination effects, in an end-to-end model where factors can be effectively disentangled. In extensive experiments, we demonstrate the versatility and robustness of our approach. Our state-of-the-art results validate the method qualitatively and for different metrics, for both geometric and color reconstruction.

引用

页码：1496 / 1505

页数：10

共 47 条

[1] Tex2Shape: Detailed Full Human Body Geometry From a Single Image [J].

Alldieck, Thiemo ;

Pons-Moll, Gerard ;

Theobalt, Christian ;

Magnor, Marcus .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :2293-2303

[2] Learning to Reconstruct People in Clothing from a Single RGB Camera [J].

Alldieck, Thiemo ;

Magnor, Marcus ;

Bhatnagar, Bharat Lal ;

Theobalt, Christian ;

Pons-Moll, Gerard .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1175-1186

[3] Video Based Reconstruction of 3D People Models [J].

Alldieck, Thiemo ;

Magnor, Marcus ;

Xu, Weipeng ;

Theobalt, Christian ;

Pons-Moll, Gerard .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8387-8397

[4]

Alldieck Thiemo, 2021, P IEEE CVF INT C COM

[5]

[Anonymous], 2019, INT C MACH LEARN

[6]

Bhatnagar Bharat Lal, 2019, INT C COMP VIS, P2

[7] Detailed Full-Body Reconstructions of Moving People from Monocular RGB-D Sequences [J].

Bogo, Federica ;

Black, Michael J. ;

Loper, Matthew ;

Romero, Javier .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2300-2308

[8] Integrated Optimization of Train Speed Profile and Timetable Considering the Location of Substations [J].

Chen, Mo ;

Fang, Qian ;

He, Tong ;

Guo, Youxing ;

Wang, Qingyuan ;

Sun, Pengfei .

2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, :460-465

[9] Photographic Image Synthesis with Cascaded Refinement Networks [J].

Chen, Qifeng ;

Koltun, Vladlen .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1520-1529

[10] Learning Implicit Fields for Generative Shape Modeling [J].

Chen, Zhiqin ;

Zhang, Hao .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5932-5941

← 1 2 3 4 5 →