IMPROVING LEARNED INVERTIBLE CODING WITH INVERTIBLE ATTENTION AND BACK-PROJECTION

被引:0
作者
Yang, Zheng [1 ]
Wang, Ronggang [1 ]
机构
[1] Peking Univ, Shenzhen Grad Sch, Sch Elect & Comp Engn, Beijing, Peoples R China
来源
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2023年
基金
中国国家自然科学基金;
关键词
Image Compression; Invertible Neural Networks; Invertible Attention; Back-Projection;
D O I
10.1109/ICIP49359.2023.10222459
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learned image compression (LIC) is developing rapidly. Due to the high-frequency information preservation property of reversible mapping, invertible neural networks (INNs) have been applied to the transformation of LIC and successfully surpassed the latest classical coding standard Versatile Video Coding (VVC) in rate-distortion performance. However, the nonlinearity of the used INNs is limited and the dimensionality reduction method before quantization is not refined enough, respectively resulting in redundancy and information loss. We use attention instead of convolution in the coupling layers of INNs to improve information extraction while maintaining reversibility. In addition, inspired by Back-Projection (BP), we design a BP mechanism in the dimensionality adjustment to reduce information loss. Combined with the advanced channel-wise autoregressive entropy model, our model achieves significant performance improvement compared to the original model and surpasses the SOTA transformation model in the high bitrate range.
引用
收藏
页码:3349 / 3353
页数:5
相关论文
共 23 条
  • [1] Balle J, 2018, INT C LEARN REPR
  • [2] Begaint J., 2020, CompressAI: a PyTorch library and evaluation platform for end-to-end compression research
  • [3] Bellard Fabrice, 2015, BPG image format
  • [4] Cheng ZX, 2020, PROC CVPR IEEE, P7936, DOI 10.1109/CVPR42600.2020.00796
  • [5] Dinh L., 2017, INT C LEARN REPR
  • [6] Dinh Laurent, 2015, ICLR
  • [7] Franzen R, 1999, Kodak lossless true color image suite
  • [8] Gao G., 2021, P IEEE CVF INT C COM, P14677
  • [9] Deep Back-Projection Networks For Super-Resolution
    Haris, Muhammad
    Shakhnarovich, Greg
    Ukita, Norimichi
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1664 - 1673
  • [10] ELIC: Efficient Learned Image Compression with Unevenly Grouped Space-Channel Contextual Adaptive Coding
    He, Dailan
    Yang, Ziming
    Peng, Weikun
    Ma, Rui
    Qin, Hongwei
    Wang, Yan
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5708 - 5717