End-to-End Optimized Versatile Image Compression With Wavelet-Like Transform

被引:119
作者
Ma, Haichuan [1 ]
Liu, Dong [1 ]
Yan, Ning [1 ]
Li, Houqiang [1 ]
Wu, Feng [1 ]
机构
[1] Univ Sci & Technol China, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei 230027, Peoples R China
关键词
Image coding; Quantization (signal); Wavelet transforms; Bit rate; Entropy coding; Rate-distortion; Deep network; end-to-end optimization; image compression; lossless compression; lossy compression; wavelet transform;
D O I
10.1109/TPAMI.2020.3026003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Built on deep networks, end-to-end optimized image compression has made impressive progress in the past few years. Previous studies usually adopt a compressive auto-encoder, where the encoder part first converts image into latent features, and then quantizes the features before encoding them into bits. Both the conversion and the quantization incur information loss, resulting in a difficulty to optimally achieve arbitrary compression ratio. We propose iWave++ as a new end-to-end optimized image compression scheme, in which iWave, a trained wavelet-like transform, converts images into coefficients without any information loss. Then the coefficients are optionally quantized and encoded into bits. Different from the previous schemes, iWave++ is versatile: a single model supports both lossless and lossy compression, and also achieves arbitrary compression ratio by simply adjusting the quantization scale. iWave++ also features a carefully designed entropy coding engine to encode the coefficients progressively, and a de-quantization module for lossy compression. Experimental results show that lossy iWave++ achieves state-of-the-art compression efficiency compared with deep network-based methods; on the Kodak dataset, lossy iWave++ leads to 17.34 percent bits saving over BPG; lossless iWave++ achieves comparable or better performance than FLIF. Our code and models are available at https://github.com/mahaichuan/Versatile-Image-Compression.
引用
收藏
页码:1247 / 1263
页数:17
相关论文
共 51 条
[1]  
Agustsson E, 2017, ADV NEUR IN, V30
[2]   Generative Adversarial Networks for Extreme Learned Image Compression [J].
Agustsson, Eirikur ;
Tschannen, Michael ;
Mentzer, Fabian ;
Timofte, Radu ;
Van Gool, Luc .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :221-231
[3]   NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study [J].
Agustsson, Eirikur ;
Timofte, Radu .
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :1122-1131
[4]  
[Anonymous], 2014, arXiv
[5]  
[Anonymous], 2015, Variable rate image compression with recurrent neural networks
[6]  
[Anonymous], 2016, NEURAL INFORM PROCES
[7]  
Asuni N., 2013, Journal of Graphics Tools, V17, P113, DOI DOI 10.1080/2165347X.2015.1024298
[8]  
Ball J., 2018, PROC INT C LEARN REP
[9]  
Balle J., 2016, End-to-end Optimized Image Compression
[10]   End-to-end optimization of nonlinear transform codes for perceptual quality [J].
Balle, Johannes ;
Laparra, Valero ;
Simoncelli, Eero P. .
2016 PICTURE CODING SYMPOSIUM (PCS), 2016,