FuRA: Fully Random Access Light Field Image Compression

被引:5
作者
Amirpour, Hadi [1 ]
Guillemot, Christine [2 ]
Timmerer, Christian [1 ]
机构
[1] Alpen Adria Univ, Christian Doppler Lab ATHENA, Klagenfurt, Austria
[2] Inria Rennes Bretagne Atlantique, 263 Ave Gen Leclerc, F-35042 Rennes, France
来源
2022 10TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP) | 2022年
基金
欧盟地平线“2020”;
关键词
Light field; coding; image representation; neural representation;
D O I
10.1109/EUVIP53989.2022.9922749
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Light fields are typically represented by multi-view images, and enable post-capture actions such as refocusing and perspective shift. To compress a light field image, its view images are typically converted into a pseudo video sequence (PVS) and the generated PVS is compressed using a video codec. However, when using the inter-coding tool of a video codec to exploit the redundancy among view images, the possibility to randomly access any view image is lost. On the other hand, when video codecs independently encode view images using the intra-coding tool, random access to view images is enabled, however, at the expense of a significant drop in the compression efficiency. To address this trade-off, we propose to use neural representations to represent 4D light fields. For each light field, a multi-layer perceptron (MLP) is trained to map the light field four dimensions to the color space, thus enabling random access even to pixels. To achieve higher compression efficiency, neural network compression techniques are deployed. The proposed method outperforms the compression efficiency of HEVC inter-coding, while providing random access to view images and even pixel values.
引用
收藏
页数:6
相关论文
共 27 条
[11]  
Feng B. Y., 2021, P INT C COMPUTER VIS
[12]  
Feng B. Y., 2021, SIGNET EFFICIENT NEU, P14224
[13]  
Han S, 2016, Arxiv, DOI arXiv:1510.00149
[14]  
Li Z, 2022, Arxiv, DOI arXiv:2105.07112
[15]   Data compression for light-field rendering [J].
Magnor, M ;
Girod, B .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2000, 10 (03) :338-343
[16]  
Mildenhall B, 2022, COMMUN ACM, V65, P99, DOI 10.1145/3503250
[17]  
Mishra R, 2020, Arxiv, DOI [arXiv:2010.03954, DOI 10.48550/ARXIV.2010.03954,CORR]
[18]  
Pereira F., JPEG PLENO LIGHT FIE
[19]  
Pratapa S, 2019, Arxiv, DOI arXiv:1805.06019
[20]   NeRV: Neural Reflectance and Visibility Fields for Relighting and View Synthesis [J].
Srinivasan, Pratul P. ;
Deng, Boyang ;
Zhang, Xiuming ;
Tancik, Matthew ;
Mildenhall, Ben ;
Barron, Jonathan T. .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :7491-7500