All-in-depth via Cross-baseline Light Field Camera

被引：0

作者：

Jin, Dingjian ^{[1
]}

Zhang, Anke ^{[1
]}

Wu, Jiamin ^{[1
]}

Wu, Gaochang ^{[2
]}

Wang, Haoqian ^{[1
]}

Fang, Lu ^{[1
]}

机构：

[1] Tsinghua Univ, Beijing, Peoples R China

[2] Northeastern Univ, Boston, MA 02115 USA

来源：

MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA | 2020年

关键词：

light field; depth map; EPI domain; cross-baseline; PATCHMATCH;

D O I：

10.1145/3394171.3413974

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Light-field (LF) camera holds great promise for passive/general depth estimation benefited from high angular resolution, yet suffering small baseline for distanced region. While stereo solution with large baseline is superior to handle distant scenarios, the problem of limited angular resolution becomes bothering for near objects. Aiming for all-in-depth solution, we propose a cross-baseline LF camera using a commercial LF camera and a monocular camera, which naturally form a 'stereo camera' enabling compensated baseline for LF camera. The idea is simple yet non-trivial, due to the significant angular resolution gap and baseline gap between LF and stereo cameras. Fusing two depth maps from LF and stereo modules in spatial domain is fluky, which relies on the imprecisely predicted depth to distinguish close or distance range, and determine the weights for fusion. Alternatively, taking the unified representation for both LF and monocular sub-aperture view in epipolar plane image (EPI) domain, we show that for each pixel, the minimum variance along different shearing degrees in EPI domain estimates its depth with the highest fidelity. By minimizing the minimum variance, the depth error is minimized accordingly. The insight is that the calculated minimum variance in EPI domain owns higher fidelity than the predicted depth in spatial domain. Extensive experiments demonstrate the superiority of our cross-baseline LF camera in providing high-quality all-in-depth map from 0.2m to 100m.

引用

页码：3559 / 3567

页数：9

共 34 条

[1]

[Anonymous], tion, DOI DOI 10.1109/CVPR.2016.614

[2] Fast Edge-Preserving PatchMatch for Large Displacement Optical Flow [J].

Bao, Linchao ;

Yang, Qingxiong ;

Jin, Hailin .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3534-3541

[3] The Fast Bilateral Solver [J].

Barron, Jonathan T. ;

Poole, Ben .

COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 :617-632

[4] The Light Field Camera: Extended Depth of Field, Aliasing, and Superresolution [J].

Bishop, Tom E. ;

Favaro, Paolo .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (05) :972-986

[5] The robust estimation of multiple motions: Parametric and piecewise-smooth flow fields [J].

Black, MJ ;

Anandan, P .

COMPUTER VISION AND IMAGE UNDERSTANDING, 1996, 63 (01) :75-104

[6] Large Displacement Optical Flow: Descriptor Matching in Variational Motion Estimation [J].

Brox, Thomas ;

Malik, Jitendra .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (03) :500-513

[7] Lucas/Kanade meets Horn/Schunck: Combining local and global optic flow methods [J].

Bruhn A. ;

Weickert J. ;

Schnörr C. .

International Journal of Computer Vision, 2005, 61 (3) :1-21

[8] Pyramid Stereo Matching Network [J].

Chang, Jia-Ren ;

Chen, Yong-Sheng .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5410-5418

[9] Light Field Stereo Matching Using Bilateral Statistics of Surface Cameras [J].

Chen, Can ;

Lin, Haiting ;

Yu, Zhan ;

Kang, Sing Bing ;

Yu, Jingyi .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1518-1525

[10] Accurate Light Field Depth Estimation With Superpixel Regularization Over Partially Occluded Regions [J].

Chen, Jie ;

Hou, Junhui ;

Ni, Yun ;

Chau, Lap-Pui .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (10) :4889-4900

← 1 2 3 4 →