Deep Learning-based Semantic Analysis of Sparse Light Field Ray Sets

被引：0

作者：

Chelli, Kelvin ^{[1
]}

Tamboli, Roopak R. ^{[1
]}

Herfet, Thorsten ^{[1
]}

机构：

[1] Saarland Informat Campus, D-66123 Saarbrucken, Germany

来源：

IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP) | 2021年

基金：

美国国家科学基金会;

关键词：

Light fields; light field analysis; deep learning; froxels; froxel histograms; IMAGE;

D O I：

10.1109/MMSP53017.2021.9733489

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

With the emergence of various light field (LF) acquisition systems and of novel techniques for processing and visualizing LFs, end-to-end LF systems start to head for the consumer market. Towards this, the semantic analysis of LFs can play a crucial role in LF processing (e.g. compression, storage and transmission), and in the standardization of LF representation schemes across various use cases. In this regard, we earlier have introduced fristograms as a tool to integrate semantics into LF processing. Fristograms collect sets of rays within a volume of a number of pixels in all 3 directions (horizontal, vertical and disparity) and thus enable semantic analysis based on the ray sets, and consequently semantic processing of LFs. Consequently, fristograms enable the application of filtering techniques considering the underlying characteristic of the scene (e.g. differentiate between Lambertian and non-Lambertian, occluded and dis-occluded regions in the scene). Motivated by the earlier results through statistical analysis of froxels enabling a significant reduction in number of rays while maintaining quality, in this paper, we explore learning-based analysis of froxels. Specifically, we propose to use a deep learning network to classify material properties (such as Lambertian, non-Lambertian, and outliers). Once the classification is done, the LF is filtered semantically. Preliminary results show that compared to the statistical ray analysis of froxels, a learning-based approach can reduce the number of rays even further, yet maintain the visual quality of the LF as measured by well-known quality metrics.

引用

页数：6

共 24 条

[1] Alain M, 2017, IEEE INT WORKSH MULT
[2] Light field intrinsics with a deep encoder-decoder network
Alperovich, Anna
Johannsen, Ole
Strecke, Michael
Goldluecke, Bastian
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 9145 - 9154
[3] [Anonymous], 2021, COD REPR IMM 2
[4] Broxton M., 2020, ACM SIGGRAPH 2020 Immersive Pavilion, P1
[5] Chelli K., 2019, P 16 ACM SIGGRAPH EU
[6] Cserkaszky A., 2017, P INT C 3D IMM IC3D, P1
[7] LiFF: Light Field Features in Scale and Depth
Dansereau, Donald G.
Girod, Bernd
Wetzstein, Gordon
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8034 - 8043
[8] Decoding, Calibration and Rectification for Lenselet-Based Plenoptic Cameras
Dansereau, Donald G.
Pizarro, Oscar
Williams, Stefan B.
[J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 1027 - 1034
[9] Evans A., SIGGRAPH 2015
[10] Fujita S, 2018, IEEE IMAGE PROC, P2600, DOI 10.1109/ICIP.2018.8451419

← 1 2 3 →