End-to-End Optimized Versatile Image Compression With Wavelet-Like Transform

被引:104
作者
Ma, Haichuan [1 ]
Liu, Dong [1 ]
Yan, Ning [1 ]
Li, Houqiang [1 ]
Wu, Feng [1 ]
机构
[1] Univ Sci & Technol China, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei 230027, Peoples R China
关键词
Image coding; Quantization (signal); Wavelet transforms; Bit rate; Entropy coding; Rate-distortion; Deep network; end-to-end optimization; image compression; lossless compression; lossy compression; wavelet transform;
D O I
10.1109/TPAMI.2020.3026003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Built on deep networks, end-to-end optimized image compression has made impressive progress in the past few years. Previous studies usually adopt a compressive auto-encoder, where the encoder part first converts image into latent features, and then quantizes the features before encoding them into bits. Both the conversion and the quantization incur information loss, resulting in a difficulty to optimally achieve arbitrary compression ratio. We propose iWave++ as a new end-to-end optimized image compression scheme, in which iWave, a trained wavelet-like transform, converts images into coefficients without any information loss. Then the coefficients are optionally quantized and encoded into bits. Different from the previous schemes, iWave++ is versatile: a single model supports both lossless and lossy compression, and also achieves arbitrary compression ratio by simply adjusting the quantization scale. iWave++ also features a carefully designed entropy coding engine to encode the coefficients progressively, and a de-quantization module for lossy compression. Experimental results show that lossy iWave++ achieves state-of-the-art compression efficiency compared with deep network-based methods; on the Kodak dataset, lossy iWave++ leads to 17.34 percent bits saving over BPG; lossless iWave++ achieves comparable or better performance than FLIF. Our code and models are available at https://github.com/mahaichuan/Versatile-Image-Compression.
引用
收藏
页码:1247 / 1263
页数:17
相关论文
共 51 条
  • [1] Agustsson E, 2017, ADV NEUR IN, V30
  • [2] Generative Adversarial Networks for Extreme Learned Image Compression
    Agustsson, Eirikur
    Tschannen, Michael
    Mentzer, Fabian
    Timofte, Radu
    Van Gool, Luc
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 221 - 231
  • [3] NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study
    Agustsson, Eirikur
    Timofte, Radu
    [J]. 2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 1122 - 1131
  • [4] [Anonymous], 2016, ADV NEURAL INFORM PR
  • [5] Asuni N., 2013, J GRAPH TOOLS, V17, P113, DOI DOI 10.1080/2165347X.2015.1024298
  • [6] Ball J., 2018, Variational image compression with a scale hyperprior
  • [7] Balle J., 2016, End-to-end optimized image compression
  • [8] End-to-end optimization of nonlinear transform codes for perceptual quality
    Balle, Johannes
    Laparra, Valero
    Simoncelli, Eero P.
    [J]. 2016 PICTURE CODING SYMPOSIUM (PCS), 2016,
  • [9] Efficient Variable Rate Image Compression With Multi-Scale Decomposition Network
    Cai, Chunlei
    Chen, Li
    Zhang, Xiaoyun
    Gao, Zhiyong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (12) : 3687 - 3700
  • [10] Variable Rate Deep Image Compression With a Conditional Autoencoder
    Choi, Yoojin
    El-Khamy, Mostafa
    Lee, Jungwon
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3146 - 3154