Deep learning integral imaging for three-dimensional visualization, object detection, and segmentation

被引:13
作者
Yi, Faliu [1 ]
Jeong, Ongee [2 ]
Moon, Inkyu [2 ]
Javidi, Bahram [3 ]
机构
[1] Spectral MD Inc, 2521 Mckinney Ave 1000, Dallas, TX 75201 USA
[2] DGIST, Dept Robot Engn, Daegu 42988, South Korea
[3] Univ Connecticut, Dept Elect & Comp Engn, U 2157, Storrs, CT 06269 USA
基金
新加坡国家研究基金会;
关键词
3D integral imaging; 3D image reconstruction; Target visualization; Instance segmentation; Convolutional neural networks; OCCLUDED OBJECTS; 3-D; RECONSTRUCTION; RECOGNITION; DISPLAY;
D O I
10.1016/j.optlaseng.2021.106695
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
A depth slice image that is computationally reconstructed from an integral imaging system consists of focused and out of focus areas. The unfocused areas affect three-dimensional (3D) image analyses and visualization including 3D object detection, extraction, and tracking. In this work, we present a deep learning integral imaging system that can reconstruct a 3D image without the out of focus areas and can accomplish target detection and segmentation at the same time. A Mask-Regional Convolutional Neural Network (Mask-RCNN) deep learning algorithm was trained using a public dataset and applied to detect and segment multiple targets in two-dimensional (2D) elemental images in the integral imaging system. The 3D images were then reconstructed using segmented elemental images with the target detected. The proposed method works well in the presence of partial occlusions. Experimental results show the performance of the proposed scheme.
引用
收藏
页数:8
相关论文
共 43 条
[1]   Automatic 3D object localization and isolation using computational integral imaging [J].
Aloni, Doron ;
Yitzhaky, Yitzhak .
APPLIED OPTICS, 2015, 54 (22) :6717-6724
[2]  
[Anonymous], 2017, IEEE INT C COMPUT VI, DOI [10.1109/iccv.201, DOI 10.1109/ICCV.2017.322]
[3]   Integral Three-Dimensional Television Using a 33-Megapixel Imaging System [J].
Arai, Jun ;
Okano, Fumio ;
Kawakita, Masahiro ;
Okui, Makoto ;
Haino, Yasuyuki ;
Yoshimura, Makoto ;
Furuya, Masato ;
Sato, Masahito .
JOURNAL OF DISPLAY TECHNOLOGY, 2010, 6 (10) :422-430
[4]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[5]   Deep Learning for Improving the Robustness of Image Encryption [J].
Chen, Jing ;
Li, Xiao-Wei ;
Wang, Qiong-Hua .
IEEE ACCESS, 2019, 7 :181083-181091
[6]   Three dimensional imaging with randomly distributed sensors [J].
DaneshPanah, Mehdi ;
Javidi, Bahram ;
Watson, Edward A. .
OPTICS EXPRESS, 2008, 16 (09) :6368-6377
[7]   Cross-talk-free integral imaging three-dimensional display based on a pyramid pinhole array [J].
Deng, Huan ;
Wang, Qiong-Hua ;
Wu, Fei ;
Luo, Cheng-Gao ;
Liu, Yao .
PHOTONICS RESEARCH, 2015, 3 (04) :173-176
[8]   Deep Learning: Methods and Applications [J].
Deng, Li ;
Yu, Dong .
FOUNDATIONS AND TRENDS IN SIGNAL PROCESSING, 2013, 7 (3-4) :I-387
[9]   Dermatologist-level classification of skin cancer with deep neural networks [J].
Esteva, Andre ;
Kuprel, Brett ;
Novoa, Roberto A. ;
Ko, Justin ;
Swetter, Susan M. ;
Blau, Helen M. ;
Thrun, Sebastian .
NATURE, 2017, 542 (7639) :115-+
[10]   Comparison of passive ranging integral imaging and active imaging digital holography for three-dimensional object recognition [J].
Frauel, Y ;
Tajahuerce, E ;
Matoba, O ;
Castro, A ;
Javidi, B .
APPLIED OPTICS, 2004, 43 (02) :452-462