End-to-End Optimized Versatile Image Compression With Wavelet-Like Transform

被引:104
|
作者
Ma, Haichuan [1 ]
Liu, Dong [1 ]
Yan, Ning [1 ]
Li, Houqiang [1 ]
Wu, Feng [1 ]
机构
[1] Univ Sci & Technol China, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei 230027, Peoples R China
关键词
Image coding; Quantization (signal); Wavelet transforms; Bit rate; Entropy coding; Rate-distortion; Deep network; end-to-end optimization; image compression; lossless compression; lossy compression; wavelet transform;
D O I
10.1109/TPAMI.2020.3026003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Built on deep networks, end-to-end optimized image compression has made impressive progress in the past few years. Previous studies usually adopt a compressive auto-encoder, where the encoder part first converts image into latent features, and then quantizes the features before encoding them into bits. Both the conversion and the quantization incur information loss, resulting in a difficulty to optimally achieve arbitrary compression ratio. We propose iWave++ as a new end-to-end optimized image compression scheme, in which iWave, a trained wavelet-like transform, converts images into coefficients without any information loss. Then the coefficients are optionally quantized and encoded into bits. Different from the previous schemes, iWave++ is versatile: a single model supports both lossless and lossy compression, and also achieves arbitrary compression ratio by simply adjusting the quantization scale. iWave++ also features a carefully designed entropy coding engine to encode the coefficients progressively, and a de-quantization module for lossy compression. Experimental results show that lossy iWave++ achieves state-of-the-art compression efficiency compared with deep network-based methods; on the Kodak dataset, lossy iWave++ leads to 17.34 percent bits saving over BPG; lossless iWave++ achieves comparable or better performance than FLIF. Our code and models are available at https://github.com/mahaichuan/Versatile-Image-Compression.
引用
收藏
页码:1247 / 1263
页数:17
相关论文
共 50 条
  • [1] End-to-end optimized image compression with the frequency-oriented transform
    Yuefeng Zhang
    Kai Lin
    Machine Vision and Applications, 2024, 35
  • [2] End-to-end optimized image compression with the frequency-oriented transform
    Zhang, Yuefeng
    Lin, Kai
    MACHINE VISION AND APPLICATIONS, 2024, 35 (02)
  • [3] End-to-End Optimized ROI Image Compression
    Cai, Chunlei
    Chen, Li
    Zhang, Xiaoyun
    Gao, Zhiyong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 3442 - 3457
  • [4] End-to-End Optimized 360° Image Compression
    Li, Mu
    Li, Jinxing
    Gu, Shuhang
    Wu, Feng
    Zhang, David
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6267 - 6281
  • [5] End-to-end optimized image compression for machines, a study
    Chamain, Lahiru D.
    Racape, Fabien
    Begaint, Jean
    Pushparaja, Akshay
    Feltman, Simon
    2021 DATA COMPRESSION CONFERENCE (DCC 2021), 2021, : 163 - 172
  • [6] End-to-end optimized image compression with competition of prior distributions
    Brummer, Benoit
    De Vleeschouwer, Christophe
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1890 - 1894
  • [7] iWave: CNN-Based Wavelet-Like Transform for Image Compression
    Ma, Haichuan
    Liu, Dong
    Xiong, Ruiqin
    Wu, Feng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (07) : 1667 - 1679
  • [8] TRANSFORM SKIP INSPIRED END-TO-END COMPRESSION FOR SCREEN CONTENT IMAGE
    Wang, Meng
    Zhang, Kai
    Zhang, Li
    Wu, Yaojun
    Li, Yue
    Li, Junru
    Wang, Shiqi
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3848 - 3852
  • [9] Improved Transform Structures For Learned Wavelet-like Fully Scalable Image Compression
    Li, Xinyue
    Naman, Aous
    Taubman, David
    2023 IEEE 25TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, MMSP, 2023,
  • [10] Wavelet-like Transform with Subbands Fusion in Decoupled Structure for Deep Image Compression
    Ma, Ke
    Wu, Yaojun
    Zhang, Zhaobin
    Esenlik, Semih
    Sun, Xiaoyan
    Zhang, Kai
    Zhang, Li
    2024 PICTURE CODING SYMPOSIUM, PCS 2024, 2024,