Dynamic Neural Networks for Adaptive Implicit Image Compression

被引：0

作者：

Huang, Binru ^{[1
]}

Zhang, Yue ^{[1
]}

Hu, Yongzhen ^{[1
]}

Dai, Shaohui ^{[1
]}

Huang, Ziyang ^{[1
]}

Chao, Fei ^{[1
,2
]}

机构：

[1] Xiamen Univ, Sch Informat, Xiamen, Peoples R China

[2] Xiamen Univ, Minist Educ China, Key Lab Multimedia Trusted Percept & Efficient Co, Xiamen 361005, Peoples R China

来源：

PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XI | 2024年 / 14435卷

关键词：

implicit neural representation; dynamic neural network; multi-level image compression; low-rank matrix synthesis;

D O I：

10.1007/978-981-99-8552-4_34

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Compression with Implicit Neural Presentations (COIN) is a neural network image compression method based on multilayer perceptron (MLP). COIN encodes an image with an MLP that maps pixel positions to RGB values matching, the weights of the MLP are quantized to obtain a code stored as an image. However, this single implicit network structure performs generally when dealing with images of multiple complexities. In this paper, we propose a novel implicit dynamic neural network to process images in a dynamic and adaptive manner. Specifically, this paper uses the Sobel operator to divide the complexity of the images and use it as a criterion to select the network width and depth adaptively. To better fit the image features, this paper concludes with further quantification of the dynamic network parameters and storage matrices. Therefore, only some of the relevant network parameters with their storage matrices are required when storing the images. In training this dynamic network, this paper uses a meta-learning approach for the multi-image compression task. Experimental results show that our method outperforms COIN and JPEG in terms of image reconstruction results for the CIFAR-10 dataset.

引用

页码：427 / 443

页数：17

共 34 条

[1] Image Generators with Conditionally-Independent Pixel Synthesis [J].

Anokhin, I ;

Demochkin, K. ;

Khakhulin, T. ;

Sterkin, G. ;

Lempitsky, V ;

Korzhenkov, D. .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14273-14282

[2] A hybrid evolutionary dynamic neural network for stock market trend analysis and prediction using unscented Kalman filter [J].

Bisoi, Ranjeeta ;

Dash, P. K. .

APPLIED SOFT COMPUTING, 2014, 19 :41-56

[3]

Bolukbasi T, 2017, PR MACH LEARN RES, V70

[4] Learning Continuous Image Representation with Local Implicit Image Function [J].

Chen, Yinbo ;

Liu, Sifei ;

Wang, Xiaolong .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :8624-8634

[5] Learning Implicit Fields for Generative Shape Modeling [J].

Chen, Zhiqin ;

Zhang, Hao .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5932-5941

[6] Processing JPEG-compressed images and documents [J].

de Queiroz, RL .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 1998, 7 (12) :1661-1672

[7]

Dupont E, 2022, Arxiv, DOI arXiv:2201.12904

[8]

Dupont E, 2021, Arxiv, DOI arXiv:2103.03123

[9]

Dupont E, 2022, Arxiv, DOI [arXiv:2102.04776, 10.1016/j.lwt.2022.113600, DOI 10.1016/J.LWT.2022.113600]

[10]

Finn C, 2017, PR MACH LEARN RES, V70

← 1 2 3 4 →