Learnable Polarization-multiplexed Modulation Imager for Depth from Defocus

被引:1
作者
Huang, Zhiwei [1 ]
Dai, Mingyou [1 ]
Yue, Tao [1 ]
Hu, Xuemei [1 ]
机构
[1] Nanjing Univ, Sch Elect Sci & Engn, Nanjing 210023, Peoples R China
来源
2023 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL PHOTOGRAPHY, ICCP | 2023年
关键词
Computational Photography; Polarization-multiplexed; Depth from Defocus; SHAPE;
D O I
10.1109/ICCP56744.2023.10233777
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Estimating depth from a single snapshot image with defocus information is still a tricky problem for the ill-posedness introduced by the limited depth cues implied in the defocus images. This paper proposes a Polarization-multiplexed Modulation Imager (PoMI) to fully utilize the multiplexed polarization channels for capturing more depth cues with a single snapshot image. The polarization-dependent modulator, i.e., Liquid Crystal Spatial Light Modulator (LC-SLM), is applied to modulate the depth information into polarization channels. A differentiable polarization-dependent modulation camera model is proposed, combined with the Polarization-Driven Attention Network, to enable the joint system optimization by end-to-end training. Extensive tests have been applied to the synthetic datasets to verify the effectiveness of the proposed method. A system prototype is built to conduct real experiments demonstrating the feasibility of the proposed method for natural scenes.
引用
收藏
页数:12
相关论文
共 68 条
[31]   Deep Polarization Cues for Transparent Object Segmentation [J].
Kalra, Agastya ;
Taamazyan, Vage ;
Rao, Supreeth Krishna ;
Venkataraman, Kartik ;
Raskar, Ramesh ;
Kadambi, Achuta .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :8599-8608
[32]   Shape from Polarization for Complex Scenes in the Wild [J].
Lei, Chenyang ;
Qi, Chenyang ;
Xie, Jiaxin ;
Fan, Na ;
Koltun, Vladlen ;
Chen, Qifeng .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :12622-12631
[33]   Image and depth from a conventional camera with a coded aperture [J].
Levin, Anat ;
Fergus, Rob ;
Durand, Fredo ;
Freeman, William T. .
ACM TRANSACTIONS ON GRAPHICS, 2007, 26 (03)
[34]   Prediction of optical modulation properties of twisted-nematic liquid-crystal display by improved measurement of Jones matrix [J].
Ma, Baiheng ;
Yao, Baoli ;
Ye, Tong ;
Lei, Ming .
JOURNAL OF APPLIED PHYSICS, 2010, 107 (07)
[35]   Compressive Light Field Photography using Overcomplete Dictionaries and Optimized Projections [J].
Marwah, Kshitij ;
Wetzstein, Gordon ;
Bando, Yosuke ;
Raskar, Ramesh .
ACM TRANSACTIONS ON GRAPHICS, 2013, 32 (04)
[36]   A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation [J].
Mayer, Nikolaus ;
Ilg, Eddy ;
Hausser, Philip ;
Fischer, Philipp ;
Cremers, Daniel ;
Dosovitskiy, Alexey ;
Brox, Thomas .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4040-4048
[37]   Glass Segmentation using Intensity and Spectral Polarization Cues [J].
Mei, Haiyang ;
Dong, Bo ;
Dong, Wen ;
Yang, Jiaxi ;
Baek, Seung-Hwan ;
Heide, Felix ;
Peers, Pieter ;
Wei, Xiaopeng ;
Yang, Xin .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :12612-12621
[38]   On deep learning techniques to boost monocular depth estimation for autonomous navigation [J].
Mendes, Raul de Queiroz ;
Ribeiro, Eduardo Godinho ;
Rosa, Nicolas dos Santos ;
Grassi, Valdir, Jr. .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2021, 136
[39]  
Metni N, 2005, IEEE DECIS CONTR P, P6078
[40]   Jones matrix method for predicting and optimizing the optical modulation properties of a liquid-crystal display [J].
Moreno, I ;
Velásquez, P ;
Fernández-Pousa, CR ;
Sánchez-López, MM .
JOURNAL OF APPLIED PHYSICS, 2003, 94 (06) :3697-3702