Dynamic Neural Networks for Adaptive Implicit Image Compression

被引：0

作者：

Huang, Binru ^{[1
]}

Zhang, Yue ^{[1
]}

Hu, Yongzhen ^{[1
]}

Dai, Shaohui ^{[1
]}

Huang, Ziyang ^{[1
]}

Chao, Fei ^{[1
,2
]}

机构：

[1] Xiamen Univ, Sch Informat, Xiamen, Peoples R China

[2] Xiamen Univ, Minist Educ China, Key Lab Multimedia Trusted Percept & Efficient Co, Xiamen 361005, Peoples R China

来源：

PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XI | 2024年 / 14435卷

关键词：

implicit neural representation; dynamic neural network; multi-level image compression; low-rank matrix synthesis;

D O I：

10.1007/978-981-99-8552-4_34

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Compression with Implicit Neural Presentations (COIN) is a neural network image compression method based on multilayer perceptron (MLP). COIN encodes an image with an MLP that maps pixel positions to RGB values matching, the weights of the MLP are quantized to obtain a code stored as an image. However, this single implicit network structure performs generally when dealing with images of multiple complexities. In this paper, we propose a novel implicit dynamic neural network to process images in a dynamic and adaptive manner. Specifically, this paper uses the Sobel operator to divide the complexity of the images and use it as a criterion to select the network width and depth adaptively. To better fit the image features, this paper concludes with further quantification of the dynamic network parameters and storage matrices. Therefore, only some of the relevant network parameters with their storage matrices are required when storing the images. In training this dynamic network, this paper uses a meta-learning approach for the multi-image compression task. Experimental results show that our method outperforms COIN and JPEG in terms of image reconstruction results for the CIFAR-10 dataset.

引用

页码：427 / 443

页数：17

共 34 条

[21]

Pathak B., 2013, INT J ADV RES ELECT, V2, P4206

[22] Spatially-Adaptive Pixelwise Networks for Fast Image Translation [J].

Shaham, Tamar Rott ;

Gharbi, Michael ;

Zhang, Richard ;

Shechtman, Eli ;

Michaeli, Tomer .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14877-14886

[23] Adversarial Generation of Continuous Images [J].

Skorokhodov, Ivan ;

Ignatyev, Savva ;

Elhoseiny, Mohamed .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :10748-10759

[24] Implicit Neural Representations for Image Compression [J].

Strumpler, Yannick ;

Postels, Janis ;

Yang, Ren ;

Van Gool, Luc ;

Tombari, Federico .

COMPUTER VISION, ECCV 2022, PT XXVI, 2022, 13686 :74-91

[25] Dynamic Embedding Projection-Gated Convolutional Neural Networks for Text Classification [J].

Tan, Zhipeng ;

Chen, Jing ;

Kang, Qi ;

Zhou, MengChu ;

Abusorrah, Abdullah ;

Sedraoui, Khaled .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (03) :973-982

[26]

Tancik M., 2020, Advances in Neural Information Pro-cessing Systems, V33, P7537

[27] Convolutional Networks with Adaptive Inference Graphs [J].

Veit, Andreas ;

Belongie, Serge .

COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 :3-18

[28] SkipNet: Learning Dynamic Routing in Convolutional Networks [J].

Wang, Xin ;

Yu, Fisher ;

Dou, Zi-Yi ;

Darrell, Trevor ;

Gonzalez, Joseph E. .

COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 :420-436

[29]

Xu XQ, 2022, Arxiv, DOI arXiv:2103.12716

[30]

Yang B, 2019, ADV NEUR IN, V32

← 1 2 3 4 →