End-to-End Optimized ROI Image Compression

被引:52
|
作者
Cai, Chunlei [1 ]
Chen, Li [1 ]
Zhang, Xiaoyun [1 ]
Gao, Zhiyong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Inst Image Commun & Network Engn, Dept Elect Engn, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金; 上海市自然科学基金;
关键词
Region of interest; lossy image compression; object segmentation; ROI coding; rate distortion optimization; convolutional neural network;
D O I
10.1109/TIP.2019.2960869
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Compressing an image with more bits automatically allocated to the region of interest (ROI) than to the background can both protect key information and reduce substantial redundancy. This paper models ROI image compression as an optimization problem of minimizing a weighted sum of the rate of the image and distortion of the ROI. The traditional framework solves this problem by cascading ROI prediction and ROI coding, through which achieving the optimized solution is impossible. To improve coding performance, we propose a novel deep-learning-based unified framework that can achieve rate distortion optimization for ROI compression. Specifically, the proposed framework includes a pair of ROI encoder and decoder convolutional neural networks and a learned entropy codec. The encoder network simultaneously generates multiscale representations that support efficient rate allocation and an implicit ROI mask that guides rate allocation. The proposed framework can automatically complete ROI image compression, and it can be optimized from data in an end-to-end manner. To effectively train the framework by back propagation, we develop a soft-to-hard ROI prediction scheme to make the entire framework differential. To improve visual quality, we propose a hierarchical distortion loss function to protect both pixel-level fidelity for ROI and structural similarity for the entire image. The proposed framework is implemented in two scenarios: salient-target and face-target ROI compression. Comparative experiments demonstrate the advantages of the proposed framework over the traditional framework, including considerably better subjective visual quality, significantly higher objective ROI compression performance and execution efficiency.
引用
收藏
页码:3442 / 3457
页数:16
相关论文
共 50 条
  • [1] End-to-End Deep ROI Image Compression
    Akutsu, Hiroaki
    Naruko, Takahiro
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (05): : 1031 - 1038
  • [2] End-to-End Optimized 360° Image Compression
    Li, Mu
    Li, Jinxing
    Gu, Shuhang
    Wu, Feng
    Zhang, David
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6267 - 6281
  • [3] End-to-end optimized image compression for machines, a study
    Chamain, Lahiru D.
    Racape, Fabien
    Begaint, Jean
    Pushparaja, Akshay
    Feltman, Simon
    2021 DATA COMPRESSION CONFERENCE (DCC 2021), 2021, : 163 - 172
  • [4] End-to-end optimized image compression with competition of prior distributions
    Brummer, Benoit
    De Vleeschouwer, Christophe
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1890 - 1894
  • [5] End-to-end optimized image compression with the frequency-oriented transform
    Yuefeng Zhang
    Kai Lin
    Machine Vision and Applications, 2024, 35
  • [6] End-to-end optimized image compression with the frequency-oriented transform
    Zhang, Yuefeng
    Lin, Kai
    MACHINE VISION AND APPLICATIONS, 2024, 35 (02)
  • [7] End-to-End Optimized Versatile Image Compression With Wavelet-Like Transform
    Ma, Haichuan
    Liu, Dong
    Yan, Ning
    Li, Houqiang
    Wu, Feng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (03) : 1247 - 1263
  • [8] Volumetric End-to-End Optimized Compression for Brain Images
    Gao, Shuo
    Zhang, Yueyi
    Liu, Dong
    Xiong, Zhiwei
    2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 503 - 506
  • [9] Noise-to-Compression Variational Autoencoder for Efficient End-to-End Optimized Image Coding
    Luo, Jixiang
    Li, Shaohui
    Dai, Wenrui
    Xu, Yuhui
    Cheng, De
    Li, Gang
    Xiong, Hongkai
    2020 DATA COMPRESSION CONFERENCE (DCC 2020), 2020, : 33 - 42
  • [10] Efficient end-to-end multispectral image compression
    Depoian, Arthur C., II
    Bailey, Colleen P.
    Guturu, Parthasarathy
    BIG DATA VI: LEARNING, ANALYTICS, AND APPLICATIONS, 2024, 13036