Image compression based on octave convolution and semantic segmentation

被引：9

作者：

Liu, Zhiyuan ^{[1
]}

Meng, Lili ^{[1
,2
]}

Tan, Yanyan ^{[1
,2
]}

Zhang, Jia ^{[1
]}

Zhang, Huaxiang ^{[1
,2
]}

机构：

[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250014, Peoples R China

[2] Shandong Normal Univ, Inst Data Sci & Technol, Jinan 250014, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2021年 / 228卷

基金：

中国国家自然科学基金;

关键词：

Image compression; Deep learning; Octave convolution; Semantic segmentation map;

D O I：

10.1016/j.knosys.2021.107254

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Lossy image compression based on deep learning usually contains stacking convolutional layers, pooling layers, and nonlinear functions. However, the feature map is obtained by the convolutional layer, which has a lot of redundancy, so we use octave convolution instead of vanilla convolution to improve compression efficiency. The feature map can be divided into high-frequency and lowfrequency information. We use octave convolution to design an automatic codec to decompose the feature map into high-frequency and low-frequency information, which effectively improves the quality of the generated image. First, the semantic segmentation map of the input image is obtained by pre-training SegNet. The ComNet uses the original image and the semantic segmentation map to generate a low-dimensional representation, and the GenNet network utilizes the low-dimensional representation and the semantic segmentation map to estimate images. Then, the residuals between the reconstructed image and the original image are encoded. Finally, the reconstructed image and the decoded residual image are used to obtain the final high-quality reconstruction. Experimental results show that our method outperforms the existing image coding standards in terms of PSNR and MS-SSIM at different bit rates, and the reconstruction of images with complex textures and semantics has more obvious advantages. (C) 2021 Elsevier B.V. All rights reserved.

引用

页数：11

共 50 条

[1] Real-Time Semantic Segmentation Network Based on Octave Convolution
Wang Xin
Wu Kaijun
LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (08)
[2] Gaussian Dilated Convolution for Semantic Image Segmentation
Shen, Falong
Zeng, Gang
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 324 - 334
[3] Image Semantic Segmentation Method Based on Atrous Algorithm and Convolution CRF
Lv, Linjue
Li, Xingwei
Jin, Jiating
Li, Xinlong
PROCEEDINGS OF 2019 IEEE 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2019), 2019, : 160 - 165
[4] A Novel Upsampling and Context Convolution for Image Semantic Segmentation
Sediqi, Khwaja Monib
Lee, Hyo Jong
SENSORS, 2021, 21 (06) : 1 - 16
[5] An enhancement model based on dense atrous and inception convolution for image semantic segmentation
Zhou, Erjing
Xu, Xiang
Xu, Baomin
Wu, Hongwei
APPLIED INTELLIGENCE, 2023, 53 (05) : 5519 - 5531
[6] A New Method for Retinal Image Semantic Segmentation Based on Fully Convolution Network
Cao, Yuning
Ban, Xiaojuan
Han, Zhishuai
Shen, Bingyang
THEORETICAL COMPUTER SCIENCE (NCTCS 2018), 2018, 882 : 27 - 45
[7] An enhancement model based on dense atrous and inception convolution for image semantic segmentation
Erjing Zhou
Xiang Xu
Baomin Xu
Hongwei Wu
Applied Intelligence, 2023, 53 : 5519 - 5531
[8] Image Semantic Segmentation Scheme based on XGBoost combination with Convolution Feature Extraction
Dai, Zichen
Liu, Xuewen
Xu, Chi
2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6334 - 6340
[9] DSSLIC: DEEP SEMANTIC SEGMENTATION-BASED LAYERED IMAGE COMPRESSION
Akbari, Mohammad
Liang, Jie
Han, Jingning
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2042 - 2046
[10] Scale-equivariant convolution for semantic segmentation of depth image
Marumo, Hidetaka
Matsubara, Takashi
IEICE NONLINEAR THEORY AND ITS APPLICATIONS, 2024, 15 (01): : 36 - 53

← 1 2 3 4 5 →