Neural Dense Non-Rigid Structure from Motion with Latent Space Constraints

被引：37

作者：

Sidhu, Vikramjit ^{[1
,2
]}

Tretschk, Edgar ^{[1
]}

Golyanik, Vladislav ^{[1
]}

Agudo, Antonio ^{[3
]}

Theobalt, Christian ^{[1
]}

机构：

[1] Max Planck Inst Informat, SIC, Saarbrucken, Germany

[2] Saarland Univ, SIC, Saarbrucken, Germany

[3] CSIC UPC, Inst Robot & Informat Ind, Barcelona, Spain

来源：

COMPUTER VISION - ECCV 2020, PT XVI | 2020年 / 12361卷

关键词：

Neural Non-Rigid Structure from Motion; Sequence period detection; Latent space constraints; Deformation auto-decoder; SHAPE;

D O I：

10.1007/978-3-030-58517-4_13

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We introduce the first dense neural non-rigid structure from motion (N-NRSfM) approach, which can be trained end-to-end in an unsupervised manner from 2D point tracks. Compared to the competing methods, our combination of loss functions is fully-differentiable and can be readily integrated into deep-learning systems. We formulate the deformation model by an auto-decoder and impose subspace constraints on the recovered latent space function in a frequency domain. Thanks to the state recurrence cue, we classify the reconstructed non-rigid surfaces based on their similarity and recover the period of the input sequence. Our N-NRSfM approach achieves competitive accuracy on widely-used benchmark sequences and high visual quality on various real videos. Apart from being a standalone technique, our method enables multiple applications including shape compression, completion and interpolation, among others. Combined with an encoder trained directly on 2D images, we perform scenario-specific monocular 3D shape reconstruction at interactive frame rates. To facilitate the reproducibility of the results and boost the new research direction, we open-source our code and provide trained models for research purposes (http://gvv.mpi-inf.mpg.de/projects/Neural_NRSfM/).

引用

页码：204 / 222

页数：19

共 69 条

[1] Force-Based Representation for Non-Rigid Shape and Elastic Model Estimation [J].

Agudo, Antonio ;

Moreno-Noguer, Francesc .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (09) :2137-2150

[2] A scalable, efficient, and accurate solution to non-rigid structure from motion [J].

Agudo, Antonio ;

Moreno-Noguer, Francesc .

COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 167 :121-133

[3] DUST: Dual Union of Spatio-Temporal Subspaces for Monocular Multiple Object 3D Reconstruction [J].

Agudo, Antonio ;

Moreno-Noguer, Francesc .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1513-1521

[4] Global Model with Local Interpretation for Dynamic Shape Reconstruction [J].

Agudo, Antonio ;

Moreno-Noguer, Francesc .

2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, :264-272

[5]

Agudo A, 2016, IEEE WINT CONF APPL

[6] Trajectory Space: A Dual Representation for Nonrigid Structure from Motion [J].

Akhter, Ijaz ;

Sheikh, Yaser ;

Khan, Sohaib ;

Kanade, Takeo .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (07) :1442-1456

[7]

[Anonymous], 2014, BRIT MACH VIS C BMVC

[8] Scalable Dense Monocular Surface Reconstruction [J].

Ansari, Mohammad Dawud ;

Golyani, Vladislav ;

Stricker, Didier .

PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, :78-87

[9] A Database and Evaluation Methodology for Optical Flow [J].

Baker, Simon ;

Scharstein, Daniel ;

Lewis, J. P. ;

Roth, Stefan ;

Black, Michael J. ;

Szeliski, Richard .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2011, 92 (01) :1-31

[10]

Bregler C, 2000, PROC CVPR IEEE, P690, DOI 10.1109/CVPR.2000.854941

← 1 2 3 4 5 6 7 →