Learnable Polarization-multiplexed Modulation Imager for Depth from Defocus

被引：1

作者：

Huang, Zhiwei ^{[1
]}

Dai, Mingyou ^{[1
]}

Yue, Tao ^{[1
]}

Hu, Xuemei ^{[1
]}

机构：

[1] Nanjing Univ, Sch Elect Sci & Engn, Nanjing 210023, Peoples R China

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL PHOTOGRAPHY, ICCP | 2023年

关键词：

Computational Photography; Polarization-multiplexed; Depth from Defocus; SHAPE;

D O I：

10.1109/ICCP56744.2023.10233777

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Estimating depth from a single snapshot image with defocus information is still a tricky problem for the ill-posedness introduced by the limited depth cues implied in the defocus images. This paper proposes a Polarization-multiplexed Modulation Imager (PoMI) to fully utilize the multiplexed polarization channels for capturing more depth cues with a single snapshot image. The polarization-dependent modulator, i.e., Liquid Crystal Spatial Light Modulator (LC-SLM), is applied to modulate the depth information into polarization channels. A differentiable polarization-dependent modulation camera model is proposed, combined with the Polarization-Driven Attention Network, to enable the joint system optimization by end-to-end training. Extensive tests have been applied to the synthetic datasets to verify the effectiveness of the proposed method. A system prototype is built to conduct real experiments demonstrating the feasibility of the proposed method for natural scenes.

引用

页数：12

共 68 条

[31] Deep Polarization Cues for Transparent Object Segmentation [J].

Kalra, Agastya ;

Taamazyan, Vage ;

Rao, Supreeth Krishna ;

Venkataraman, Kartik ;

Raskar, Ramesh ;

Kadambi, Achuta .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :8599-8608

[32] Shape from Polarization for Complex Scenes in the Wild [J].

Lei, Chenyang ;

Qi, Chenyang ;

Xie, Jiaxin ;

Fan, Na ;

Koltun, Vladlen ;

Chen, Qifeng .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :12622-12631

[33] Image and depth from a conventional camera with a coded aperture [J].

Levin, Anat ;

Fergus, Rob ;

Durand, Fredo ;

Freeman, William T. .

ACM TRANSACTIONS ON GRAPHICS, 2007, 26 (03)

[34] Prediction of optical modulation properties of twisted-nematic liquid-crystal display by improved measurement of Jones matrix [J].

Ma, Baiheng ;

Yao, Baoli ;

Ye, Tong ;

Lei, Ming .

JOURNAL OF APPLIED PHYSICS, 2010, 107 (07)

[35] Compressive Light Field Photography using Overcomplete Dictionaries and Optimized Projections [J].

Marwah, Kshitij ;

Wetzstein, Gordon ;

Bando, Yosuke ;

Raskar, Ramesh .

ACM TRANSACTIONS ON GRAPHICS, 2013, 32 (04)

[36] A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation [J].

Mayer, Nikolaus ;

Ilg, Eddy ;

Hausser, Philip ;

Fischer, Philipp ;

Cremers, Daniel ;

Dosovitskiy, Alexey ;

Brox, Thomas .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4040-4048

[37] Glass Segmentation using Intensity and Spectral Polarization Cues [J].

Mei, Haiyang ;

Dong, Bo ;

Dong, Wen ;

Yang, Jiaxi ;

Baek, Seung-Hwan ;

Heide, Felix ;

Peers, Pieter ;

Wei, Xiaopeng ;

Yang, Xin .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :12612-12621

[38] On deep learning techniques to boost monocular depth estimation for autonomous navigation [J].

Mendes, Raul de Queiroz ;

Ribeiro, Eduardo Godinho ;

Rosa, Nicolas dos Santos ;

Grassi, Valdir, Jr. .

ROBOTICS AND AUTONOMOUS SYSTEMS, 2021, 136

[39]

Metni N, 2005, IEEE DECIS CONTR P, P6078

[40] Jones matrix method for predicting and optimizing the optical modulation properties of a liquid-crystal display [J].

Moreno, I ;

Velásquez, P ;

Fernández-Pousa, CR ;

Sánchez-López, MM .

JOURNAL OF APPLIED PHYSICS, 2003, 94 (06) :3697-3702

← 1 2 3 4 5 6 7 →