Neural Dense Non-Rigid Structure from Motion with Latent Space Constraints

被引:37
作者
Sidhu, Vikramjit [1 ,2 ]
Tretschk, Edgar [1 ]
Golyanik, Vladislav [1 ]
Agudo, Antonio [3 ]
Theobalt, Christian [1 ]
机构
[1] Max Planck Inst Informat, SIC, Saarbrucken, Germany
[2] Saarland Univ, SIC, Saarbrucken, Germany
[3] CSIC UPC, Inst Robot & Informat Ind, Barcelona, Spain
来源
COMPUTER VISION - ECCV 2020, PT XVI | 2020年 / 12361卷
关键词
Neural Non-Rigid Structure from Motion; Sequence period detection; Latent space constraints; Deformation auto-decoder; SHAPE;
D O I
10.1007/978-3-030-58517-4_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce the first dense neural non-rigid structure from motion (N-NRSfM) approach, which can be trained end-to-end in an unsupervised manner from 2D point tracks. Compared to the competing methods, our combination of loss functions is fully-differentiable and can be readily integrated into deep-learning systems. We formulate the deformation model by an auto-decoder and impose subspace constraints on the recovered latent space function in a frequency domain. Thanks to the state recurrence cue, we classify the reconstructed non-rigid surfaces based on their similarity and recover the period of the input sequence. Our N-NRSfM approach achieves competitive accuracy on widely-used benchmark sequences and high visual quality on various real videos. Apart from being a standalone technique, our method enables multiple applications including shape compression, completion and interpolation, among others. Combined with an encoder trained directly on 2D images, we perform scenario-specific monocular 3D shape reconstruction at interactive frame rates. To facilitate the reproducibility of the results and boost the new research direction, we open-source our code and provide trained models for research purposes (http://gvv.mpi-inf.mpg.de/projects/Neural_NRSfM/).
引用
收藏
页码:204 / 222
页数:19
相关论文
共 69 条
[41]   C3DPO: Canonical 3D Pose Networks for Non-Rigid Structure From Motion [J].
Novotny, David ;
Ravi, Nikhila ;
Graham, Benjamin ;
Neverova, Natalia ;
Vedaldi, Andrea .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :7687-7696
[42]  
Östlund J, 2012, LECT NOTES COMPUT SC, V7574, P412, DOI 10.1007/978-3-642-33712-3_30
[43]   Optimal Metric Projections for Deformable and Articulated Structure-from-Motion [J].
Paladini, Marco ;
Del Bue, Alessio ;
Xavier, Joao ;
Agapito, Lourdes ;
Stosic, Marko ;
Dodig, Marija .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 96 (02) :252-276
[44]   DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation [J].
Park, Jeong Joon ;
Florence, Peter ;
Straub, Julian ;
Newcombe, Richard ;
Lovegrove, Steven .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :165-174
[45]  
Paszke A, 2019, ADV NEUR IN, V32
[46]   On lines and planes of closest fit to systems of points in space. [J].
Pearson, Karl .
PHILOSOPHICAL MAGAZINE, 1901, 2 (7-12) :559-572
[47]   Geometry-Aware Network for Non-Rigid Shape Prediction from a Single View [J].
Pumarola, A. ;
Agudo, A. ;
Porzi, L. ;
Sanfeliu, A. ;
Lepetit, V. ;
Moreno-Noguer, F. .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4681-4690
[48]  
RIEDMILLER M, 1993, 1993 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, P586, DOI 10.1109/ICNN.1993.298623
[49]   LEARNING REPRESENTATIONS BY BACK-PROPAGATING ERRORS [J].
RUMELHART, DE ;
HINTON, GE ;
WILLIAMS, RJ .
NATURE, 1986, 323 (6088) :533-536
[50]  
Russell C., 2011, 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P3009, DOI 10.1109/CVPR.2011.5995383