End-to-end optimized image compression with the frequency-oriented transform

被引：0

作者：

Zhang, Yuefeng ^{[1
]}

Lin, Kai ^{[2
]}

机构：

[1] Beijing Inst Comp Technol & Applicat, 51th Yongding Rd, Beijing 100039, Peoples R China

[2] Peking Univ, Sch Comp Sci, Beijing 100871, Peoples R China

来源：

MACHINE VISION AND APPLICATIONS | 2024年 / 35卷 / 02期

关键词：

Image compression; Image processing; Computer vision; Machine learning;

D O I：

10.1007/s00138-023-01507-x

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Image compression constitutes a significant challenge amid the era of information explosion. Recent studies employing deep learning methods have demonstrated the superior performance of learning-based image compression methods over traditional codecs. However, an inherent challenge associated with these methods lies in their lack of interpretability. Following an analysis of the varying degrees of compression degradation across different frequency bands, we propose the end-to-end optimized image compression model facilitated by the frequency-oriented transform. The proposed end-to-end image compression model consists of four components: spatial sampling, frequency-oriented transform, entropy estimation, and frequency-aware fusion. The frequency-oriented transform separates the original image signal into distinct frequency bands, aligning with the human-interpretable concept. Leveraging the non-overlapping hypothesis, the model enables scalable coding through the selective transmission of arbitrary frequency components. Extensive experiments are conducted to demonstrate that our model outperforms all traditional codecs including next-generation standard H.266/VVC on MS-SSIM metric. Moreover, visual analysis tasks (i.e., object detection and semantic segmentation) are conducted to verify the proposed compression method that could preserve semantic fidelity besides signal-level precision.

引用

页数：14

共 50 条

[1] End-to-end optimized image compression with the frequency-oriented transform
Yuefeng Zhang
Kai Lin
Machine Vision and Applications, 2024, 35
[2] End-to-End Optimized Versatile Image Compression With Wavelet-Like Transform
Ma, Haichuan
Liu, Dong
Yan, Ning
Li, Houqiang
Wu, Feng
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (03) : 1247 - 1263
[3] End-to-End Optimized ROI Image Compression
Cai, Chunlei
Chen, Li
Zhang, Xiaoyun
Gao, Zhiyong
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 3442 - 3457
[4] End-to-End Optimized 360° Image Compression
Li, Mu
Li, Jinxing
Gu, Shuhang
Wu, Feng
Zhang, David
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6267 - 6281
[5] End-to-end optimized image compression for machines, a study
Chamain, Lahiru D.
Racape, Fabien
Begaint, Jean
Pushparaja, Akshay
Feltman, Simon
2021 DATA COMPRESSION CONFERENCE (DCC 2021), 2021, : 163 - 172
[6] End-to-end optimized image compression with competition of prior distributions
Brummer, Benoit
De Vleeschouwer, Christophe
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1890 - 1894
[7] TRANSFORM SKIP INSPIRED END-TO-END COMPRESSION FOR SCREEN CONTENT IMAGE
Wang, Meng
Zhang, Kai
Zhang, Li
Wu, Yaojun
Li, Yue
Li, Junru
Wang, Shiqi
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3848 - 3852
[8] Volumetric End-to-End Optimized Compression for Brain Images
Gao, Shuo
Zhang, Yueyi
Liu, Dong
Xiong, Zhiwei
2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 503 - 506
[9] Noise-to-Compression Variational Autoencoder for Efficient End-to-End Optimized Image Coding
Luo, Jixiang
Li, Shaohui
Dai, Wenrui
Xu, Yuhui
Cheng, De
Li, Gang
Xiong, Hongkai
2020 DATA COMPRESSION CONFERENCE (DCC 2020), 2020, : 33 - 42
[10] Efficient end-to-end multispectral image compression
Depoian, Arthur C., II
Bailey, Colleen P.
Guturu, Parthasarathy
BIG DATA VI: LEARNING, ANALYTICS, AND APPLICATIONS, 2024, 13036

← 1 2 3 4 5 →