Deep Non-Rigid Structure From Motion With Missing Data

被引:5
作者
Kong, Chen [1 ]
Lucey, Simon [1 ]
机构
[1] Carnegie Mellon Univ, Robot Inst, 5000 Forbes Ave, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
Three-dimensional displays; Shape; Two dimensional displays; Encoding; Neural networks; Image reconstruction; Structure from motion; Nonrigid structure from motion; hierarchical sparse coding; deep neural network; reconstructability; missing data; THRESHOLDING ALGORITHM; SHAPE;
D O I
10.1109/TPAMI.2020.2997026
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Non-rigid structure from motion (NRSfM) refers to the problem of reconstructing cameras and the 3D point cloud of a non-rigid object from an ensemble of images with 2D correspondences. Current NRSfM algorithms are limited from two perspectives: (i) the number of images, and (ii) the type of shape variability they can handle. These difficulties stem from the inherent conflict between the condition of the system and the degrees of freedom needing to be modeled - which has hampered its practical utility for many applications within vision. In this paper we propose a novel hierarchical sparse coding model for NRSFM which can overcome (i) and (ii) to such an extent, that NRSFM can be applied to problems in vision previously thought too ill posed. Our approach is realized in practice as the training of an unsupervised deep neural network (DNN) auto-encoder with a unique architecture that is able to disentangle pose from 3D structure. Using modern deep learning computational platforms allows us to solve NRSfM problems at an unprecedented scale and shape complexity. Our approach has no 3D supervision, relying solely on 2D point correspondences. Further, our approach is also able to handle missing/occluded 2D points without the need for matrix completion. Extensive experiments demonstrate the impressive performance of our approach where we exhibit superior precision and robustness against all available state-of-the-art works in some instances by an order of magnitude. We further propose a new quality measure (based on the network weights) which circumvents the need for 3D ground-truth to ascertain the confidence we have in the reconstructability. We believe our work to be a significant advance over state-of-the-art in NRSFM.
引用
收藏
页码:4365 / 4377
页数:13
相关论文
共 50 条
[21]   A Benchmark and Evaluation of Non-Rigid Structure from Motion [J].
Sebastian Hoppe Nesgaard Jensen ;
Mads Emil Brix Doest ;
Henrik Aanæs ;
Alessio Del Bue .
International Journal of Computer Vision, 2021, 129 :882-899
[22]   Online Non-rigid Structure-from-Motion based on a Keyframe Representation of History [J].
Donne, Simon ;
Jovanov, Ljubomir ;
Goossens, Bart ;
Philips, Wilfried ;
Pizurica, Aleksandra .
PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, 2014, :723-731
[23]   Multi-body Non-rigid Structure-from-Motion [J].
Kumar, Suryansh ;
Dai, Yuchao ;
Li, Hongdong .
PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, :148-156
[24]   NON-RIGID STRUCTURE FROM MOTION VIA SPARSE SELF-EXPRESSIVE REPRESENTATION [J].
Hu, Junjie ;
Aoki, Terumasa .
2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, :4537-4541
[25]   Model Evolution: An Incremental Approach to Non-Rigid Structure from Motion [J].
Zhu, Shengqi ;
Zhang, Li ;
Smith, Brandon M. .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :1165-1172
[26]   Local Non-Rigid Structure-from-Motion from Diffeomorphic Mappings [J].
Parashar, Shaifali ;
Salzmann, Mathieu ;
Fua, Pascal .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :2056-2064
[27]   Non-Rigid Shape From Water [J].
Kuo, Meng-Yu Jennifer ;
Kawahara, Ryo ;
Nobuhara, Shohei ;
Nishino, Ko .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (07) :2220-2232
[28]   A Simple Prior-Free Method for Non-rigid Structure-from-Motion Factorization [J].
Dai, Yuchao ;
Li, Hongdong ;
He, Mingyi .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 107 (02) :101-122
[29]   An Accelerated Procrustean Markov Process Model With Coherent Constraint for Non-Rigid Structure From Motion [J].
Zhang, Ying ;
Chen, Xia ;
Sun, Zhan-Li ;
Lam, Kin-Man ;
Zeng, Zhigang .
IEEE ACCESS, 2019, 7 :145013-145021
[30]   NON-RIGID STRUCTURE FROM MOTION WITH INCREMENTAL SHAPE PRIOR [J].
Tao, Lili ;
Matuszewski, Bogdan J. ;
Mein, Stephen J. .
2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, :1753-1756