FuRA: Fully Random Access Light Field Image Compression

被引:5
作者
Amirpour, Hadi [1 ]
Guillemot, Christine [2 ]
Timmerer, Christian [1 ]
机构
[1] Alpen Adria Univ, Christian Doppler Lab ATHENA, Klagenfurt, Austria
[2] Inria Rennes Bretagne Atlantique, 263 Ave Gen Leclerc, F-35042 Rennes, France
来源
2022 10TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP) | 2022年
基金
欧盟地平线“2020”;
关键词
Light field; coding; image representation; neural representation;
D O I
10.1109/EUVIP53989.2022.9922749
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Light fields are typically represented by multi-view images, and enable post-capture actions such as refocusing and perspective shift. To compress a light field image, its view images are typically converted into a pseudo video sequence (PVS) and the generated PVS is compressed using a video codec. However, when using the inter-coding tool of a video codec to exploit the redundancy among view images, the possibility to randomly access any view image is lost. On the other hand, when video codecs independently encode view images using the intra-coding tool, random access to view images is enabled, however, at the expense of a significant drop in the compression efficiency. To address this trade-off, we propose to use neural representations to represent 4D light fields. For each light field, a multi-layer perceptron (MLP) is trained to map the light field four dimensions to the color space, thus enabling random access even to pixels. To achieve higher compression efficiency, neural network compression techniques are deployed. The proposed method outperforms the compression efficiency of HEVC inter-coding, while providing random access to view images and even pixel values.
引用
收藏
页数:6
相关论文
共 27 条
[1]   Efficient Light Field Image Compression with Enhanced Random Access [J].
Amirpour, Hadi ;
Pinheiro, Antonio ;
Pereira, Manuela ;
Lopes, Fernando J. P. ;
Ghanbari, Mohammad .
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (02)
[2]  
Amirpour H, 2019, INT CONF ACOUST SPEE, P2402, DOI [10.1109/ICASSP.2019.8683215, 10.1109/icassp.2019.8683215]
[3]   Light field image compression with random access [J].
Amirpour, Hadi ;
Pinheiro, Antonio ;
Pereira, Manuela ;
Lopes, Fernando J. P. ;
Ghanbari, Mohammad .
2019 DATA COMPRESSION CONFERENCE (DCC), 2019, :553-553
[4]   High efficient snake order pseudo-sequence based light field image compression [J].
Amirpour, Hadi ;
Pereira, Manuela ;
Pinheiro, Antonio M. G. .
2018 DATA COMPRESSION CONFERENCE (DCC 2018), 2018, :397-397
[5]   Random access prediction structures for light field video coding with MV-HEVC [J].
Avramelos, Vasileios ;
De Praeter, Johan ;
Van Wallendael, Glenn ;
Lambert, Peter .
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (19-20) :12847-12867
[6]   X-Fields: Implicit Neural View-, Light- and Time-Image Interpolation [J].
Bemana, Mojtaba ;
Myszkowski, Karol ;
Seidel, Hans-Peter ;
Ritschel, Tobias .
ACM TRANSACTIONS ON GRAPHICS, 2020, 39 (06)
[7]   NeRD: Neural Reflectance Decomposition from Image Collections [J].
Boss, Mark ;
Braun, Raphael ;
Jampani, Varun ;
Barron, Jonathan T. ;
Liu, Ce ;
Lensch, Hendrik P. A. .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :12664-12674
[8]  
Cha Zhang, 2000, Proceedings DCC 2000. Data Compression Conference, P253, DOI 10.1109/DCC.2000.838165
[9]   MVSNeRF: Fast Generalizable Radiance Field Reconstruction from Multi-View Stereo [J].
Chen, Anpei ;
Xu, Zexiang ;
Zhao, Fuqiang ;
Zhang, Xiaoshuai ;
Xiang, Fanbo ;
Yu, Jingyi ;
Su, Hao .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :14104-14113
[10]  
de Carvalho MB, 2018, IEEE IMAGE PROC, P435, DOI 10.1109/ICIP.2018.8451684