End-to-End Optimized Versatile Image Compression With Wavelet-Like Transform

被引：104

作者：

Ma, Haichuan ^{[1
]}

Liu, Dong ^{[1
]}

Yan, Ning ^{[1
]}

Li, Houqiang ^{[1
]}

Wu, Feng ^{[1
]}

机构：

[1] Univ Sci & Technol China, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei 230027, Peoples R China

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2022年 / 44卷 / 03期

关键词：

Image coding; Quantization (signal); Wavelet transforms; Bit rate; Entropy coding; Rate-distortion; Deep network; end-to-end optimization; image compression; lossless compression; lossy compression; wavelet transform;

D O I：

10.1109/TPAMI.2020.3026003

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Built on deep networks, end-to-end optimized image compression has made impressive progress in the past few years. Previous studies usually adopt a compressive auto-encoder, where the encoder part first converts image into latent features, and then quantizes the features before encoding them into bits. Both the conversion and the quantization incur information loss, resulting in a difficulty to optimally achieve arbitrary compression ratio. We propose iWave++ as a new end-to-end optimized image compression scheme, in which iWave, a trained wavelet-like transform, converts images into coefficients without any information loss. Then the coefficients are optionally quantized and encoded into bits. Different from the previous schemes, iWave++ is versatile: a single model supports both lossless and lossy compression, and also achieves arbitrary compression ratio by simply adjusting the quantization scale. iWave++ also features a carefully designed entropy coding engine to encode the coefficients progressively, and a de-quantization module for lossy compression. Experimental results show that lossy iWave++ achieves state-of-the-art compression efficiency compared with deep network-based methods; on the Kodak dataset, lossy iWave++ leads to 17.34 percent bits saving over BPG; lossless iWave++ achieves comparable or better performance than FLIF. Our code and models are available at https://github.com/mahaichuan/Versatile-Image-Compression.

引用

页码：1247 / 1263

页数：17

共 50 条

[21] Towards End-to-End Image Compression and Analysis with Transformers
Bai, Yuanchao
Yang, Xu
Liu, Xianming
Jiang, Junjun
Wang, Yaowei
Ji, Xiangyang
Gao, Wen
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 104 - 112
[22] End-to-End Image Classification and Compression With Variational Autoencoders
Chamain, Lahiru D.
Qi, Siyu
Ding, Zhi
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (21): : 21916 - 21931
[23] End-to-end Image Compression with Swin-Transformer
Wang, Meng
Zhang, Kai
Zhang, Li
Li, Yue
Li, Junru
Wang, Yue
Wang, Shiqi
2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
[24] Fully Integerized End-to-End Learned Image Compression
Fang, Yimian
Fei, Wen
Li, Shaohui
Dai, Wenrui
Li, Chenglin
Zou, Junni
Xiong, Hongkai
2023 DATA COMPRESSION CONFERENCE, DCC, 2023, : 337 - 337
[25] Learning End-to-End Lossy Image Compression: A Benchmark
Hu, Yueyu
Yang, Wenhan
Ma, Zhan
Liu, Jiaying
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (08) : 4194 - 4211
[26] Spatial-Channel Context-Based Entropy Modeling for End-to-end Optimized Image Compression
Li, Chongxin
Luo, Jixiang
Dai, Wenrui
Li, Chenglin
Zou, Junni
Xiong, Hongkai
2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 222 - 225
[27] Bi-directional prediction for end-to-end optimized video compression
Racape, Fabien
Begaint, Jean
Feltman, Simon
Pushparaja, Akshay
APPLICATIONS OF DIGITAL IMAGE PROCESSING XLIV, 2021, 11842
[28] End-to-end Optimized Video Compression with MV-Residual Prediction
Wu, XiangJi
Zhang, Ziwen
Feng, Jie
Zhou, Lei
Wu, Junmin
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 611 - 614
[29] End-to-End Learning-Based Image Compression: A Review
Chen Jimin
Lin Zehao
LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (22)
[30] End-to-end image compression method based on perception metric
Shuai Liu
Yingcong Huang
Huoxiang Yang
Yongsheng Liang
Wei Liu
Signal, Image and Video Processing, 2022, 16 : 1803 - 1810

← 1 2 3 4 5 →