Learning Reliable Gradients From Undersampled Circular Light Field for 3D Reconstruction

被引:5
作者
Song, Zhengxi [1 ]
Wang, Xue [1 ]
Zhu, Hao [2 ]
Zhou, Guoqing [1 ]
Wang, Qing [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China
[2] Nanjing Univ, Sch Elect Sci & Engn, Nanjing 210023, Jiangsu, Peoples R China
关键词
Three-dimensional displays; Trajectory; Light fields; Reliability; Cameras; Image reconstruction; Estimation; 3d reconstruction; circular light field; CNN plus LSTM; circular epipolar plane volume (CEPV); NETWORK; REGISTRATION; STEREO; DEPTH;
D O I
10.1109/TVCG.2022.3206207
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The paper presents a 3D reconstruction algorithm from an undersampled circular light field (LF). With an ultra-dense angular sampling rate, every scene point captured by a circular LF corresponds to a smooth trajectory in the circular epipolar plane volume (CEPV). Thus per-pixel disparities can be calculated by retrieving the local gradients of the CEPV-trajectories. However, the continuous curve will be broken up into discrete segments in an undersampled circular LF, which leads to a noticeable deterioration of the 3D reconstruction accuracy. We observe that the coherent structure is still embedded in the discrete segments. With less noise and ambiguity, the scene points can be reconstructed using gradients from reliable epipolar plane image (EPI) regions. By analyzing the geometric characteristics of the coherent structure in the CEPV, both the trajectory itself and its gradients could be modeled as 3D predictable series. Thus a mask-guided CNN+LSTM network is proposed to learn the mapping from the CEPV with a lower angular sampling rate to the gradients under a higher angular sampling rate. To segment the reliable regions, the reliable-mask-based loss that assesses the difference between learned gradients and ground truth gradients is added to the loss function. We construct a synthetic circular LF dataset with ground truth for depth and foreground/background segmentation to train the network. Moreover, a real-scene circular LF dataset is collected for performance evaluation. Experimental results on both public and self-constructed datasets demonstrate the superiority of the proposed method over existing state-of-the-art methods.
引用
收藏
页码:5194 / 5207
页数:14
相关论文
共 47 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
B. Foundation, 2017, Blender
[3]   When is the shape of a scene unique given its light-field: A fundamental theorem of 3D vision? [J].
Baker, S ;
Sim, T ;
Kanade, T .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (01) :100-109
[4]   A METHOD FOR REGISTRATION OF 3-D SHAPES [J].
BESL, PJ ;
MCKAY, ND .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1992, 14 (02) :239-256
[5]   EPIPOLAR-PLANE IMAGE-ANALYSIS - AN APPROACH TO DETERMINING STRUCTURE FROM MOTION [J].
BOLLES, RC ;
BAKER, HH ;
MARIMONT, DH .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1987, 1 (01) :7-55
[6]   DISTANCE TRANSFORMATIONS IN ARBITRARY DIMENSIONS [J].
BORGEFORS, G .
COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1984, 27 (03) :321-345
[7]   Light Field Stereo Matching Using Bilateral Statistics of Surface Cameras [J].
Chen, Can ;
Lin, Haiting ;
Yu, Zhan ;
Kang, Sing Bing ;
Yu, Jingyi .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1518-1525
[8]   MVSNet plus plus : Learning Depth-Based Attention Pyramid Features for Multi-View Stereo [J].
Chen, Po-Heng ;
Yang, Hsiao-Chien ;
Chen, Kuan-Wen ;
Chen, Yong-Sheng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :7261-7273
[9]   OBJECT MODELING BY REGISTRATION OF MULTIPLE RANGE IMAGES [J].
CHEN, Y ;
MEDIONI, G .
IMAGE AND VISION COMPUTING, 1992, 10 (03) :145-155
[10]  
Cignoni Paolo., 2008, EUR IT CHAPT C, V2008, P129, DOI [DOI 10.2312/LOCALCHAPTEREVENTS/ITALCHAP/ITALIANCHAPCONF2008/129-136, 10.2312/LocalChapterEvents/ItalChap/ItalianChapConf2008, DOI 10.2312/LOCALCHAPTEREVENTS]