Learning-based Image Coding: Early Solutions Reviewing and Subjective Quality Evaluation

被引:16
作者
Ascenso, Joao [1 ]
Akyazi, Pinar [2 ]
Pereira, Fernando [1 ]
Ebrahimi, Touradj [2 ]
机构
[1] Univ Lisbon, Inst Telecomunicacoes, Inst Super Tecn, Lisbon, Portugal
[2] Ecole Polytech Fed Lausanne EPFL, Multimedia Signal Proc Grp, Lausanne, Switzerland
来源
OPTICS, PHOTONICS AND DIGITAL TECHNOLOGIES FOR IMAGING APPLICATIONS VI | 2021年 / 11353卷
关键词
deep learning; image compression; performance assessment; subjective quality;
D O I
10.1117/12.2555368
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
Nowadays, image and video are the data types that consume most of the resources of modern communication channels, both in fixed and wireless networks. Thus, it is vital to compress visual data as much as possible, while maintaining some target quality level, to enable efficient storage and transmission. Deep learning (DL) image coding solutions, typically using an auto-encoder architecture, promise significant improvements in compression efficiency. These methods adopt a novel coding approach where the encoder-decoder architecture is mostly based on neural networks, notably with analysis and synthesis transforms learned from a large amount of training data and an appropriate loss function. There are limited amount of works targeting the subjective evaluation of DL learning-based image coding solutions compression performance. Since learning-based image codecs use complex and highly non-linear generative models, very different artifacts are present in the decoded images, when compared to conventional artifacts such as blockiness, blurring and ringing distortions typical of traditional DCT block-based and wavelet image coding. In this context, the main objective of this paper is to review, characterize and evaluate some of the most relevant learning-based image coding solutions in the literature. Regarding the subjective quality evaluation, the assessment tests were conducted during the 84th JPEG meeting in Brussels, Belgium, by a mix of experts and naive observers. These subjective tests evaluated the performance of five state-of-the-art learning-based image coding solutions against four conventional, standard image coding (HEVC, WebP, JPEG 2000 and JPEG), applied to eight natural images, at four different coding bitrates. The experimental results obtained show that the subjective quality obtained with the selected learning-based image coding solution are competitive with conventional codecs. Moreover, a thorough inspection on the visual results has revealed some of the typical artifacts encountered in the learning -based image coding.
引用
收藏
页数:13
相关论文
共 44 条
[1]  
Agustsson E., 2019, INT C COMP VIS SEOUL INT C COMP VIS SEOUL
[2]  
Agustsson E., 2017, Soft-to-hard vector quantization for end-to-end learning compressible representations
[3]  
Akbari M, 2019, INT CONF ACOUST SPEE, P2042, DOI [10.1109/icassp.2019.8683541, 10.1109/ICASSP.2019.8683541]
[4]  
[Anonymous], 2018, ITU R BT2100
[5]  
[Anonymous], 2012, ITURBT2022 ITURBT2022
[6]  
[Anonymous], 2012, ITURBT50013
[7]  
[Anonymous], 2017, IEEE C COMP VIS PATT
[8]  
Ascenso J., 2019, 1SC29WG1N85013 ISO I 1SC29WG1N85013 ISO I
[9]  
Ascenso J., 2019, 84 M BRUSS BELG 84 M BRUSS BELG
[10]  
Aytekin C., 2018, CVPR WORKSH CHALL LE CVPR WORKSH CHALL LE