IMPROVING LEARNED INVERTIBLE CODING WITH INVERTIBLE ATTENTION AND BACK-PROJECTION

被引：0

作者：

Yang, Zheng ^{[1
]}

Wang, Ronggang ^{[1
]}

机构：

[1] Peking Univ, Shenzhen Grad Sch, Sch Elect & Comp Engn, Beijing, Peoples R China

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2023年

基金：

中国国家自然科学基金;

关键词：

Image Compression; Invertible Neural Networks; Invertible Attention; Back-Projection;

D O I：

10.1109/ICIP49359.2023.10222459

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Learned image compression (LIC) is developing rapidly. Due to the high-frequency information preservation property of reversible mapping, invertible neural networks (INNs) have been applied to the transformation of LIC and successfully surpassed the latest classical coding standard Versatile Video Coding (VVC) in rate-distortion performance. However, the nonlinearity of the used INNs is limited and the dimensionality reduction method before quantization is not refined enough, respectively resulting in redundancy and information loss. We use attention instead of convolution in the coupling layers of INNs to improve information extraction while maintaining reversibility. In addition, inspired by Back-Projection (BP), we design a BP mechanism in the dimensionality adjustment to reduce information loss. Combined with the advanced channel-wise autoregressive entropy model, our model achieves significant performance improvement compared to the original model and surpasses the SOTA transformation model in the high bitrate range.

引用

页码：3349 / 3353

页数：5

共 23 条

[1] Balle J, 2018, INT C LEARN REPR
[2] Begaint J., 2020, CompressAI: a PyTorch library and evaluation platform for end-to-end compression research
[3] Bellard Fabrice, 2015, BPG image format
[4] Cheng ZX, 2020, PROC CVPR IEEE, P7936, DOI 10.1109/CVPR42600.2020.00796
[5] Dinh L., 2017, INT C LEARN REPR
[6] Dinh Laurent, 2015, ICLR
[7] Franzen R, 1999, Kodak lossless true color image suite
[8] Gao G., 2021, P IEEE CVF INT C COM, P14677
[9] Deep Back-Projection Networks For Super-Resolution
Haris, Muhammad
Shakhnarovich, Greg
Ukita, Norimichi
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1664 - 1673
[10] ELIC: Efficient Learned Image Compression with Unevenly Grouped Space-Channel Contextual Adaptive Coding
He, Dailan
Yang, Ziming
Peng, Weikun
Ma, Rui
Qin, Hongwei
Wang, Yan
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5708 - 5717

← 1 2 3 →