End-to-End Optimized Versatile Image Compression With Wavelet-Like Transform

被引:104
|
作者
Ma, Haichuan [1 ]
Liu, Dong [1 ]
Yan, Ning [1 ]
Li, Houqiang [1 ]
Wu, Feng [1 ]
机构
[1] Univ Sci & Technol China, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei 230027, Peoples R China
关键词
Image coding; Quantization (signal); Wavelet transforms; Bit rate; Entropy coding; Rate-distortion; Deep network; end-to-end optimization; image compression; lossless compression; lossy compression; wavelet transform;
D O I
10.1109/TPAMI.2020.3026003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Built on deep networks, end-to-end optimized image compression has made impressive progress in the past few years. Previous studies usually adopt a compressive auto-encoder, where the encoder part first converts image into latent features, and then quantizes the features before encoding them into bits. Both the conversion and the quantization incur information loss, resulting in a difficulty to optimally achieve arbitrary compression ratio. We propose iWave++ as a new end-to-end optimized image compression scheme, in which iWave, a trained wavelet-like transform, converts images into coefficients without any information loss. Then the coefficients are optionally quantized and encoded into bits. Different from the previous schemes, iWave++ is versatile: a single model supports both lossless and lossy compression, and also achieves arbitrary compression ratio by simply adjusting the quantization scale. iWave++ also features a carefully designed entropy coding engine to encode the coefficients progressively, and a de-quantization module for lossy compression. Experimental results show that lossy iWave++ achieves state-of-the-art compression efficiency compared with deep network-based methods; on the Kodak dataset, lossy iWave++ leads to 17.34 percent bits saving over BPG; lossless iWave++ achieves comparable or better performance than FLIF. Our code and models are available at https://github.com/mahaichuan/Versatile-Image-Compression.
引用
收藏
页码:1247 / 1263
页数:17
相关论文
共 50 条
  • [21] Towards End-to-End Image Compression and Analysis with Transformers
    Bai, Yuanchao
    Yang, Xu
    Liu, Xianming
    Jiang, Junjun
    Wang, Yaowei
    Ji, Xiangyang
    Gao, Wen
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 104 - 112
  • [22] End-to-End Image Classification and Compression With Variational Autoencoders
    Chamain, Lahiru D.
    Qi, Siyu
    Ding, Zhi
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (21): : 21916 - 21931
  • [23] End-to-end Image Compression with Swin-Transformer
    Wang, Meng
    Zhang, Kai
    Zhang, Li
    Li, Yue
    Li, Junru
    Wang, Yue
    Wang, Shiqi
    2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
  • [24] Fully Integerized End-to-End Learned Image Compression
    Fang, Yimian
    Fei, Wen
    Li, Shaohui
    Dai, Wenrui
    Li, Chenglin
    Zou, Junni
    Xiong, Hongkai
    2023 DATA COMPRESSION CONFERENCE, DCC, 2023, : 337 - 337
  • [25] Learning End-to-End Lossy Image Compression: A Benchmark
    Hu, Yueyu
    Yang, Wenhan
    Ma, Zhan
    Liu, Jiaying
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (08) : 4194 - 4211
  • [26] Spatial-Channel Context-Based Entropy Modeling for End-to-end Optimized Image Compression
    Li, Chongxin
    Luo, Jixiang
    Dai, Wenrui
    Li, Chenglin
    Zou, Junni
    Xiong, Hongkai
    2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 222 - 225
  • [27] Bi-directional prediction for end-to-end optimized video compression
    Racape, Fabien
    Begaint, Jean
    Feltman, Simon
    Pushparaja, Akshay
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLIV, 2021, 11842
  • [28] End-to-end Optimized Video Compression with MV-Residual Prediction
    Wu, XiangJi
    Zhang, Ziwen
    Feng, Jie
    Zhou, Lei
    Wu, Junmin
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 611 - 614
  • [29] End-to-End Learning-Based Image Compression: A Review
    Chen Jimin
    Lin Zehao
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (22)
  • [30] End-to-end image compression method based on perception metric
    Shuai Liu
    Yingcong Huang
    Huoxiang Yang
    Yongsheng Liang
    Wei Liu
    Signal, Image and Video Processing, 2022, 16 : 1803 - 1810