Neural Dense Non-Rigid Structure from Motion with Latent Space Constraints

被引：37

作者：

Sidhu, Vikramjit ^{[1
,2
]}

Tretschk, Edgar ^{[1
]}

Golyanik, Vladislav ^{[1
]}

Agudo, Antonio ^{[3
]}

Theobalt, Christian ^{[1
]}

机构：

[1] Max Planck Inst Informat, SIC, Saarbrucken, Germany

[2] Saarland Univ, SIC, Saarbrucken, Germany

[3] CSIC UPC, Inst Robot & Informat Ind, Barcelona, Spain

来源：

COMPUTER VISION - ECCV 2020, PT XVI | 2020年 / 12361卷

关键词：

Neural Non-Rigid Structure from Motion; Sequence period detection; Latent space constraints; Deformation auto-decoder; SHAPE;

D O I：

10.1007/978-3-030-58517-4_13

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We introduce the first dense neural non-rigid structure from motion (N-NRSfM) approach, which can be trained end-to-end in an unsupervised manner from 2D point tracks. Compared to the competing methods, our combination of loss functions is fully-differentiable and can be readily integrated into deep-learning systems. We formulate the deformation model by an auto-decoder and impose subspace constraints on the recovered latent space function in a frequency domain. Thanks to the state recurrence cue, we classify the reconstructed non-rigid surfaces based on their similarity and recover the period of the input sequence. Our N-NRSfM approach achieves competitive accuracy on widely-used benchmark sequences and high visual quality on various real videos. Apart from being a standalone technique, our method enables multiple applications including shape compression, completion and interpolation, among others. Combined with an encoder trained directly on 2D images, we perform scenario-specific monocular 3D shape reconstruction at interactive frame rates. To facilitate the reproducibility of the results and boost the new research direction, we open-source our code and provide trained models for research purposes (http://gvv.mpi-inf.mpg.de/projects/Neural_NRSfM/).

引用

页码：204 / 222

页数：19

共 69 条

[41] C3DPO: Canonical 3D Pose Networks for Non-Rigid Structure From Motion [J].

Novotny, David ;

Ravi, Nikhila ;

Graham, Benjamin ;

Neverova, Natalia ;

Vedaldi, Andrea .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :7687-7696

[42]

Östlund J, 2012, LECT NOTES COMPUT SC, V7574, P412, DOI 10.1007/978-3-642-33712-3_30

[43] Optimal Metric Projections for Deformable and Articulated Structure-from-Motion [J].

Paladini, Marco ;

Del Bue, Alessio ;

Xavier, Joao ;

Agapito, Lourdes ;

Stosic, Marko ;

Dodig, Marija .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 96 (02) :252-276

[44] DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation [J].

Park, Jeong Joon ;

Florence, Peter ;

Straub, Julian ;

Newcombe, Richard ;

Lovegrove, Steven .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :165-174

[45]

Paszke A, 2019, ADV NEUR IN, V32

[46] On lines and planes of closest fit to systems of points in space. [J].

Pearson, Karl .

PHILOSOPHICAL MAGAZINE, 1901, 2 (7-12) :559-572

[47] Geometry-Aware Network for Non-Rigid Shape Prediction from a Single View [J].

Pumarola, A. ;

Agudo, A. ;

Porzi, L. ;

Sanfeliu, A. ;

Lepetit, V. ;

Moreno-Noguer, F. .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4681-4690

[48]

RIEDMILLER M, 1993, 1993 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, P586, DOI 10.1109/ICNN.1993.298623

[49] LEARNING REPRESENTATIONS BY BACK-PROPAGATING ERRORS [J].

RUMELHART, DE ;

HINTON, GE ;

WILLIAMS, RJ .

NATURE, 1986, 323 (6088) :533-536

[50]

Russell C., 2011, 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P3009, DOI 10.1109/CVPR.2011.5995383

← 1 2 3 4 5 6 7 →