End-to-End Optimized Versatile Image Compression With Wavelet-Like Transform

被引：104

作者：

Ma, Haichuan ^{[1
]}

Liu, Dong ^{[1
]}

Yan, Ning ^{[1
]}

Li, Houqiang ^{[1
]}

Wu, Feng ^{[1
]}

机构：

[1] Univ Sci & Technol China, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei 230027, Peoples R China

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2022年 / 44卷 / 03期

关键词：

Image coding; Quantization (signal); Wavelet transforms; Bit rate; Entropy coding; Rate-distortion; Deep network; end-to-end optimization; image compression; lossless compression; lossy compression; wavelet transform;

D O I：

10.1109/TPAMI.2020.3026003

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Built on deep networks, end-to-end optimized image compression has made impressive progress in the past few years. Previous studies usually adopt a compressive auto-encoder, where the encoder part first converts image into latent features, and then quantizes the features before encoding them into bits. Both the conversion and the quantization incur information loss, resulting in a difficulty to optimally achieve arbitrary compression ratio. We propose iWave++ as a new end-to-end optimized image compression scheme, in which iWave, a trained wavelet-like transform, converts images into coefficients without any information loss. Then the coefficients are optionally quantized and encoded into bits. Different from the previous schemes, iWave++ is versatile: a single model supports both lossless and lossy compression, and also achieves arbitrary compression ratio by simply adjusting the quantization scale. iWave++ also features a carefully designed entropy coding engine to encode the coefficients progressively, and a de-quantization module for lossy compression. Experimental results show that lossy iWave++ achieves state-of-the-art compression efficiency compared with deep network-based methods; on the Kodak dataset, lossy iWave++ leads to 17.34 percent bits saving over BPG; lossless iWave++ achieves comparable or better performance than FLIF. Our code and models are available at https://github.com/mahaichuan/Versatile-Image-Compression.

引用

页码：1247 / 1263

页数：17

共 50 条

[31] End-to-End Learning-Based Image Compression: A Review
Chen Jimin
Lin Zehao
LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (22)
[32] End-to-End Learned Image Compression with Augmented Normalizing Flows
Ho, Yung-Han
Chan, Chih-Chun
Peng, Wen-Hsiao
Hang, Hsueh-Ming
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1931 - 1935
[33] An end-to-end spike-based image compression architecture
Doutsi, Effrosyni
Antonini, Marc
Tsakalides, Panagiotis
2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 818 - 820
[34] End-to-end image compression method based on perception metric
Liu, Shuai
Huang, Yingcong
Yang, Huoxiang
Liang, Yongsheng
Liu, Wei
SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (07) : 1803 - 1810
[35] Estimating the resize parameter in end-to-end learned image compression
Chen, Li-Heng
Bampis, Christos G.
Li, Zhi
Krasula, Lukas
Bovik, Alan C.
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2025, 135
[36] A Reference Resource Based End-to-End Image Compression Scheme
Yin, Wenbin
Fan, Xiaopeng
Shi, Yunhui
Zuo, Wangmeng
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 534 - 544
[37] End-to-End Image Patch Quality Assessment for Image/Video With Compression Artifacts
Tung Thanh Pham
Xiem Van Hoang
Nghia Trung Nguyen
Duong Trieu Dinh
Le Thanh Ha
IEEE ACCESS, 2020, 8 : 215157 - 215172
[38] End-to-end optimized nonlinear Fourier transform-based coherent communications
Gaiarin, Simone
Jones, Rasmus T.
Da Ros, Francesco
Zibar, Darko
2020 CONFERENCE ON LASERS AND ELECTRO-OPTICS (CLEO), 2020,
[39] Compression of End-to-End Models
Pang, Ruoming
Sainath, Tara N.
Prabhavalkar, Rohit
Gupta, Suyog
Wu, Yonghui
Zhang, Shuyuan
Chiu, Chung-cheng
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 27 - 31
[40] Exploration of Learned Lifting-Based Transform Structures for Fully Scalable and Accessible Wavelet-Like Image Compression
Li, Xinyue
Naman, Aous
Taubman, David
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 6173 - 6188

← 1 2 3 4 5 →