Neural Dense Non-Rigid Structure from Motion with Latent Space Constraints

被引:37
作者
Sidhu, Vikramjit [1 ,2 ]
Tretschk, Edgar [1 ]
Golyanik, Vladislav [1 ]
Agudo, Antonio [3 ]
Theobalt, Christian [1 ]
机构
[1] Max Planck Inst Informat, SIC, Saarbrucken, Germany
[2] Saarland Univ, SIC, Saarbrucken, Germany
[3] CSIC UPC, Inst Robot & Informat Ind, Barcelona, Spain
来源
COMPUTER VISION - ECCV 2020, PT XVI | 2020年 / 12361卷
关键词
Neural Non-Rigid Structure from Motion; Sequence period detection; Latent space constraints; Deformation auto-decoder; SHAPE;
D O I
10.1007/978-3-030-58517-4_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce the first dense neural non-rigid structure from motion (N-NRSfM) approach, which can be trained end-to-end in an unsupervised manner from 2D point tracks. Compared to the competing methods, our combination of loss functions is fully-differentiable and can be readily integrated into deep-learning systems. We formulate the deformation model by an auto-decoder and impose subspace constraints on the recovered latent space function in a frequency domain. Thanks to the state recurrence cue, we classify the reconstructed non-rigid surfaces based on their similarity and recover the period of the input sequence. Our N-NRSfM approach achieves competitive accuracy on widely-used benchmark sequences and high visual quality on various real videos. Apart from being a standalone technique, our method enables multiple applications including shape compression, completion and interpolation, among others. Combined with an encoder trained directly on 2D images, we perform scenario-specific monocular 3D shape reconstruction at interactive frame rates. To facilitate the reproducibility of the results and boost the new research direction, we open-source our code and provide trained models for research purposes (http://gvv.mpi-inf.mpg.de/projects/Neural_NRSfM/).
引用
收藏
页码:204 / 222
页数:19
相关论文
共 69 条
[1]   Force-Based Representation for Non-Rigid Shape and Elastic Model Estimation [J].
Agudo, Antonio ;
Moreno-Noguer, Francesc .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (09) :2137-2150
[2]   A scalable, efficient, and accurate solution to non-rigid structure from motion [J].
Agudo, Antonio ;
Moreno-Noguer, Francesc .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 167 :121-133
[3]   DUST: Dual Union of Spatio-Temporal Subspaces for Monocular Multiple Object 3D Reconstruction [J].
Agudo, Antonio ;
Moreno-Noguer, Francesc .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1513-1521
[4]   Global Model with Local Interpretation for Dynamic Shape Reconstruction [J].
Agudo, Antonio ;
Moreno-Noguer, Francesc .
2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, :264-272
[5]  
Agudo A, 2016, IEEE WINT CONF APPL
[6]   Trajectory Space: A Dual Representation for Nonrigid Structure from Motion [J].
Akhter, Ijaz ;
Sheikh, Yaser ;
Khan, Sohaib ;
Kanade, Takeo .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (07) :1442-1456
[7]  
[Anonymous], 2014, BRIT MACH VIS C BMVC
[8]   Scalable Dense Monocular Surface Reconstruction [J].
Ansari, Mohammad Dawud ;
Golyani, Vladislav ;
Stricker, Didier .
PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, :78-87
[9]   A Database and Evaluation Methodology for Optical Flow [J].
Baker, Simon ;
Scharstein, Daniel ;
Lewis, J. P. ;
Roth, Stefan ;
Black, Michael J. ;
Szeliski, Richard .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2011, 92 (01) :1-31
[10]  
Bregler C, 2000, PROC CVPR IEEE, P690, DOI 10.1109/CVPR.2000.854941