End-to-end optimized image compression with the frequency-oriented transform

被引:0
|
作者
Zhang, Yuefeng [1 ]
Lin, Kai [2 ]
机构
[1] Beijing Inst Comp Technol & Applicat, 51th Yongding Rd, Beijing 100039, Peoples R China
[2] Peking Univ, Sch Comp Sci, Beijing 100871, Peoples R China
关键词
Image compression; Image processing; Computer vision; Machine learning;
D O I
10.1007/s00138-023-01507-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image compression constitutes a significant challenge amid the era of information explosion. Recent studies employing deep learning methods have demonstrated the superior performance of learning-based image compression methods over traditional codecs. However, an inherent challenge associated with these methods lies in their lack of interpretability. Following an analysis of the varying degrees of compression degradation across different frequency bands, we propose the end-to-end optimized image compression model facilitated by the frequency-oriented transform. The proposed end-to-end image compression model consists of four components: spatial sampling, frequency-oriented transform, entropy estimation, and frequency-aware fusion. The frequency-oriented transform separates the original image signal into distinct frequency bands, aligning with the human-interpretable concept. Leveraging the non-overlapping hypothesis, the model enables scalable coding through the selective transmission of arbitrary frequency components. Extensive experiments are conducted to demonstrate that our model outperforms all traditional codecs including next-generation standard H.266/VVC on MS-SSIM metric. Moreover, visual analysis tasks (i.e., object detection and semantic segmentation) are conducted to verify the proposed compression method that could preserve semantic fidelity besides signal-level precision.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] End-to-end optimized image compression with the frequency-oriented transform
    Yuefeng Zhang
    Kai Lin
    Machine Vision and Applications, 2024, 35
  • [2] End-to-End Optimized Versatile Image Compression With Wavelet-Like Transform
    Ma, Haichuan
    Liu, Dong
    Yan, Ning
    Li, Houqiang
    Wu, Feng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (03) : 1247 - 1263
  • [3] End-to-End Optimized ROI Image Compression
    Cai, Chunlei
    Chen, Li
    Zhang, Xiaoyun
    Gao, Zhiyong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 3442 - 3457
  • [4] End-to-End Optimized 360° Image Compression
    Li, Mu
    Li, Jinxing
    Gu, Shuhang
    Wu, Feng
    Zhang, David
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6267 - 6281
  • [5] End-to-end optimized image compression for machines, a study
    Chamain, Lahiru D.
    Racape, Fabien
    Begaint, Jean
    Pushparaja, Akshay
    Feltman, Simon
    2021 DATA COMPRESSION CONFERENCE (DCC 2021), 2021, : 163 - 172
  • [6] End-to-end optimized image compression with competition of prior distributions
    Brummer, Benoit
    De Vleeschouwer, Christophe
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1890 - 1894
  • [7] TRANSFORM SKIP INSPIRED END-TO-END COMPRESSION FOR SCREEN CONTENT IMAGE
    Wang, Meng
    Zhang, Kai
    Zhang, Li
    Wu, Yaojun
    Li, Yue
    Li, Junru
    Wang, Shiqi
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3848 - 3852
  • [8] Volumetric End-to-End Optimized Compression for Brain Images
    Gao, Shuo
    Zhang, Yueyi
    Liu, Dong
    Xiong, Zhiwei
    2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 503 - 506
  • [9] Noise-to-Compression Variational Autoencoder for Efficient End-to-End Optimized Image Coding
    Luo, Jixiang
    Li, Shaohui
    Dai, Wenrui
    Xu, Yuhui
    Cheng, De
    Li, Gang
    Xiong, Hongkai
    2020 DATA COMPRESSION CONFERENCE (DCC 2020), 2020, : 33 - 42
  • [10] Efficient end-to-end multispectral image compression
    Depoian, Arthur C., II
    Bailey, Colleen P.
    Guturu, Parthasarathy
    BIG DATA VI: LEARNING, ANALYTICS, AND APPLICATIONS, 2024, 13036