End-to-end optimized image compression for machines, a study

被引:42
|
作者
Chamain, Lahiru D. [1 ,2 ]
Racape, Fabien [1 ]
Begaint, Jean [1 ]
Pushparaja, Akshay [1 ]
Feltman, Simon [1 ]
机构
[1] InterDigital AI Lab, 4410 El Camino Real, Los Altos, CA 94022 USA
[2] Univ Calif Davis, 1 Shields Ave, Davis, CA 95616 USA
关键词
D O I
10.1109/DCC50243.2021.00024
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
An increasing share of image and video content is analyzed by machines rather than viewed by humans, and therefore it becomes relevant to optimize codecs for such applications where the analysis is performed remotely. Unfortunately, conventional coding tools are challenging to specialize for machine tasks as they were originally designed for human perception. However, neural network based codecs can be jointly trained end-to-end with any convolutional neural network (CNN)-based task model. In this paper, we propose to study an end-to-end framework enabling efficient image compression for remote machine task analysis, using a chain composed of a compression module and a task algorithm that can be optimized end-to-end. We show that it is possible to significantly improve the task accuracy when fine-tuning jointly the codec and the task networks, especially at low bit-rates. Depending on training or deployment constraints, selective fine-tuning can be applied only on the encoder, decoder or task network and still achieve rate-accuracy improvements over an off-the-shelf codec and task network. Our results also demonstrate the flexibility of end-to-end pipelines for practical applications.
引用
收藏
页码:163 / 172
页数:10
相关论文
共 50 条
  • [1] End-to-End Optimized ROI Image Compression
    Cai, Chunlei
    Chen, Li
    Zhang, Xiaoyun
    Gao, Zhiyong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 3442 - 3457
  • [2] End-to-End Optimized 360° Image Compression
    Li, Mu
    Li, Jinxing
    Gu, Shuhang
    Wu, Feng
    Zhang, David
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6267 - 6281
  • [3] End-to-end optimized image compression with competition of prior distributions
    Brummer, Benoit
    De Vleeschouwer, Christophe
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1890 - 1894
  • [4] End-to-end optimized image compression with the frequency-oriented transform
    Yuefeng Zhang
    Kai Lin
    Machine Vision and Applications, 2024, 35
  • [5] End-to-end optimized image compression with the frequency-oriented transform
    Zhang, Yuefeng
    Lin, Kai
    MACHINE VISION AND APPLICATIONS, 2024, 35 (02)
  • [6] End-to-End Optimized Versatile Image Compression With Wavelet-Like Transform
    Ma, Haichuan
    Liu, Dong
    Yan, Ning
    Li, Houqiang
    Wu, Feng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (03) : 1247 - 1263
  • [7] Volumetric End-to-End Optimized Compression for Brain Images
    Gao, Shuo
    Zhang, Yueyi
    Liu, Dong
    Xiong, Zhiwei
    2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 503 - 506
  • [8] Noise-to-Compression Variational Autoencoder for Efficient End-to-End Optimized Image Coding
    Luo, Jixiang
    Li, Shaohui
    Dai, Wenrui
    Xu, Yuhui
    Cheng, De
    Li, Gang
    Xiong, Hongkai
    2020 DATA COMPRESSION CONFERENCE (DCC 2020), 2020, : 33 - 42
  • [9] Efficient end-to-end multispectral image compression
    Depoian, Arthur C., II
    Bailey, Colleen P.
    Guturu, Parthasarathy
    BIG DATA VI: LEARNING, ANALYTICS, AND APPLICATIONS, 2024, 13036
  • [10] End-to-End Deep ROI Image Compression
    Akutsu, Hiroaki
    Naruko, Takahiro
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (05): : 1031 - 1038