End-to-End Optimized Versatile Image Compression With Wavelet-Like Transform

被引:104
|
作者
Ma, Haichuan [1 ]
Liu, Dong [1 ]
Yan, Ning [1 ]
Li, Houqiang [1 ]
Wu, Feng [1 ]
机构
[1] Univ Sci & Technol China, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei 230027, Peoples R China
关键词
Image coding; Quantization (signal); Wavelet transforms; Bit rate; Entropy coding; Rate-distortion; Deep network; end-to-end optimization; image compression; lossless compression; lossy compression; wavelet transform;
D O I
10.1109/TPAMI.2020.3026003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Built on deep networks, end-to-end optimized image compression has made impressive progress in the past few years. Previous studies usually adopt a compressive auto-encoder, where the encoder part first converts image into latent features, and then quantizes the features before encoding them into bits. Both the conversion and the quantization incur information loss, resulting in a difficulty to optimally achieve arbitrary compression ratio. We propose iWave++ as a new end-to-end optimized image compression scheme, in which iWave, a trained wavelet-like transform, converts images into coefficients without any information loss. Then the coefficients are optionally quantized and encoded into bits. Different from the previous schemes, iWave++ is versatile: a single model supports both lossless and lossy compression, and also achieves arbitrary compression ratio by simply adjusting the quantization scale. iWave++ also features a carefully designed entropy coding engine to encode the coefficients progressively, and a de-quantization module for lossy compression. Experimental results show that lossy iWave++ achieves state-of-the-art compression efficiency compared with deep network-based methods; on the Kodak dataset, lossy iWave++ leads to 17.34 percent bits saving over BPG; lossless iWave++ achieves comparable or better performance than FLIF. Our code and models are available at https://github.com/mahaichuan/Versatile-Image-Compression.
引用
收藏
页码:1247 / 1263
页数:17
相关论文
共 50 条
  • [31] End-to-End Learning-Based Image Compression: A Review
    Chen Jimin
    Lin Zehao
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (22)
  • [32] End-to-End Learned Image Compression with Augmented Normalizing Flows
    Ho, Yung-Han
    Chan, Chih-Chun
    Peng, Wen-Hsiao
    Hang, Hsueh-Ming
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1931 - 1935
  • [33] An end-to-end spike-based image compression architecture
    Doutsi, Effrosyni
    Antonini, Marc
    Tsakalides, Panagiotis
    2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 818 - 820
  • [34] End-to-end image compression method based on perception metric
    Liu, Shuai
    Huang, Yingcong
    Yang, Huoxiang
    Liang, Yongsheng
    Liu, Wei
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (07) : 1803 - 1810
  • [35] Estimating the resize parameter in end-to-end learned image compression
    Chen, Li-Heng
    Bampis, Christos G.
    Li, Zhi
    Krasula, Lukas
    Bovik, Alan C.
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2025, 135
  • [36] A Reference Resource Based End-to-End Image Compression Scheme
    Yin, Wenbin
    Fan, Xiaopeng
    Shi, Yunhui
    Zuo, Wangmeng
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 534 - 544
  • [37] End-to-End Image Patch Quality Assessment for Image/Video With Compression Artifacts
    Tung Thanh Pham
    Xiem Van Hoang
    Nghia Trung Nguyen
    Duong Trieu Dinh
    Le Thanh Ha
    IEEE ACCESS, 2020, 8 : 215157 - 215172
  • [38] End-to-end optimized nonlinear Fourier transform-based coherent communications
    Gaiarin, Simone
    Jones, Rasmus T.
    Da Ros, Francesco
    Zibar, Darko
    2020 CONFERENCE ON LASERS AND ELECTRO-OPTICS (CLEO), 2020,
  • [39] Compression of End-to-End Models
    Pang, Ruoming
    Sainath, Tara N.
    Prabhavalkar, Rohit
    Gupta, Suyog
    Wu, Yonghui
    Zhang, Shuyuan
    Chiu, Chung-cheng
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 27 - 31
  • [40] Exploration of Learned Lifting-Based Transform Structures for Fully Scalable and Accessible Wavelet-Like Image Compression
    Li, Xinyue
    Naman, Aous
    Taubman, David
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 6173 - 6188