OPTIMIZED DECOUPLED STRUCTURE WITH NON-LOCAL ATTENTION FOR DEEP IMAGE COMPRESSION

被引:0
作者
Zhang, Xuanye [1 ]
Zhang, Zhaobin [2 ]
Wu, Yaojun [3 ]
Esenlik, Semih [2 ]
Sun, Xiaoyan [1 ]
Zhang, Kai [2 ]
Zhang, Li [2 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
[2] Bytedance Inc, San Diego, CA 95110 USA
[3] Bytedance Inc, Beijing, Peoples R China
来源
2024 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2024年
基金
中国国家自然科学基金;
关键词
Decoupled; end-to-end; neural networks; non-local attention; IEEE; 1857.11; image compression;
D O I
10.1109/ICIP51287.2024.10648246
中图分类号
学科分类号
摘要
Recently, a decoupled framework for learning-based image compression has been proposed and adopted into the JPEG AI image coding standard developed by ISO/IEC WG1. The decoupled structure disentangles the sample reconstruction process and the entropy decoding process, making the decoding extremely fast. The corresponding techniques constitute the essential parts of the JPEG AI verification model software. However, its analysis transform and synthesis transform are relatively simple, which are built with stacked convolution layers, thereby may lack the capability to interpret data correlations. In this work, we enhance the transform networks by introducing the non-local attention mechanism, which has proven efficient in image compression tasks. The proposed framework thus shares the merits of the fast decoding from the decoupled architecture and the strong transform capabilities from the non-local attention, making it a stronger candidate for practical end-to-end image codec deployment. Experimental results on the Kodak test set and JPEG AI CfP test set show that our method achieves better BDRate performance compared to the original Decoupled-anchor and significantly faster decoding speed compared to NIC. The proposed solution has been adopted by the IEEE 1857.11 Working Subgroup (1857.11 WSG) in developing neural network-based image coding standards in the 10th Meeting.
引用
收藏
页码:3681 / 3687
页数:7
相关论文
共 17 条
  • [1] [Anonymous], 2022, ISO/IEC JTC1/SC29/WG1 N100106
  • [2] [Anonymous], 2023, M 10 IEEE 1857 11 WO
  • [3] Ball‚ J, 2017, Arxiv, DOI arXiv:1611.01704
  • [4] Ball‚ J, 2018, Arxiv, DOI arXiv:1802.01436
  • [5] End-to-end optimization of nonlinear transform codes for perceptual quality
    Balle, Johannes
    Laparra, Valero
    Simoncelli, Eero P.
    [J]. 2016 PICTURE CODING SYMPOSIUM (PCS), 2016,
  • [6] Chen T, 2019, arXiv
  • [7] Cheng ZX, 2020, PROC CVPR IEEE, P7936, DOI 10.1109/CVPR42600.2020.00796
  • [8] Esenlik S., 2023, 9 IEEE 1857 11 WORK
  • [9] graphics, Kodak lossless true color image suite
  • [10] JPEG 2000: Retrospective and new developments
    Lee, DT
    [J]. PROCEEDINGS OF THE IEEE, 2005, 93 (01) : 32 - 41