Data-Driven 3D Reconstruction of Dressed Humans From Sparse Views

被引：12

作者：

Zins, Pierre ^{[1
]}

Xu, Yuanlu ^{[2
]}

Boyer, Edmond ^{[1
]}

Wuhrer, Stefanie ^{[1
]}

Tung, Tony ^{[2
]}

机构：

[1] Univ Grenoble Alpes, INRIA, LJK, CNRS,Grenoble INP,Inst Engn, F-38000 Grenoble, France

[2] Facebook Real Labs, Sausalito, CA USA

来源：

2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021) | 2021年

关键词：

SHAPE; TRACKING; CAPTURE; MOTION; POSE;

D O I：

10.1109/3DV53792.2021.00059

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, data-driven single-view reconstruction methods have shown great progress in modeling 3D dressed humans. However, such methods suffer heavily from depth ambiguities and occlusions inherent to single view inputs. In this paper, we tackle this problem by considering a small set of input views and investigate the best strategy to suitably exploit information from these views. We propose a data-driven end-to-end approach that reconstructs an implicit 3D representation of dressed humans from sparse camera views. Specifically, we introduce three key components: first a spatially consistent reconstruction that allows for arbitrary placement of the person in the input views using a perspective camera model; second an attention-based fusion layer that learns to aggregate visual information from several viewpoints; and third a mechanism that encodes local 3D patterns under the multi-view context. In the experiments, we show the proposed approach outperforms the state of the art on standard data both quantitatively and qualitatively. To demonstrate the spatially consistent reconstruction, we apply our approach to dynamic scenes. Additionally, we apply our method on real data acquired with a multi-camera platform and demonstrate our approach can obtain results comparable to multi-view stereo with dramatically less views.

引用

页码：494 / 504

页数：11

共 64 条

[1] Video Based Reconstruction of 3D People Models [J].

Alldieck, Thiemo ;

Magnor, Marcus ;

Xu, Weipeng ;

Theobalt, Christian ;

Pons-Moll, Gerard .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8387-8397

[2] SCAPE: Shape Completion and Animation of People [J].

Anguelov, D ;

Srinivasan, P ;

Koller, D ;

Thrun, S ;

Rodgers, J ;

Davis, J .

ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (03) :408-416

[3]

Balan AO, 2008, LECT NOTES COMPUT SC, V5303, P15, DOI 10.1007/978-3-540-88688-4_2

[4]

Balan AO, 2007, IEEE I CONF COMP VIS, P1379

[5] Combining Implicit Function Learning and Parametric Models for 3D Human Reconstruction [J].

Bhatnagar, Bharat Lal ;

Sminchisescu, Cristian ;

Theobalt, Christian ;

Pons-Moll, Gerard .

COMPUTER VISION - ECCV 2020, PT II, 2020, 12347 :311-329

[6] Multi-Garment Net: Learning to Dress 3D People from Images [J].

Bhatnagar, Bharat Lal ;

Tiwari, Garvita ;

Theobalt, Christian ;

Pons-Moll, Gerard .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :5419-5429

[7] Keep It SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image [J].

Bogo, Federica ;

Kanazawa, Angjoo ;

Lassner, Christoph ;

Gehler, Peter ;

Romero, Javier ;

Black, Michael J. .

COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 :561-578

[8] Detailed Full-Body Reconstructions of Moving People from Monocular RGB-D Sequences [J].

Bogo, Federica ;

Black, Michael J. ;

Loper, Matthew ;

Romero, Javier .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2300-2308

[9] Performance capture from sparse multi-view video [J].

de Aguiar, Edilson ;

Stoll, Carsten ;

Theobalt, Christian ;

Ahmed, Naveed ;

Seidel, Hans-Peter ;

Thrun, Sebastian .

ACM TRANSACTIONS ON GRAPHICS, 2008, 27 (03)

[10] HS-Nets : Estimating Human Body Shape from Silhouettes with Convolutional Neural Networks [J].

Dibra, Endri ;

Jain, Himanshu ;

Oeztireli, Cengiz ;

Ziegler, Remo ;

Gross, Markus .

PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, :108-117

← 1 2 3 4 5 6 7 →