Image compression based on octave convolution and semantic segmentation

被引:9
|
作者
Liu, Zhiyuan [1 ]
Meng, Lili [1 ,2 ]
Tan, Yanyan [1 ,2 ]
Zhang, Jia [1 ]
Zhang, Huaxiang [1 ,2 ]
机构
[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250014, Peoples R China
[2] Shandong Normal Univ, Inst Data Sci & Technol, Jinan 250014, Peoples R China
基金
中国国家自然科学基金;
关键词
Image compression; Deep learning; Octave convolution; Semantic segmentation map;
D O I
10.1016/j.knosys.2021.107254
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lossy image compression based on deep learning usually contains stacking convolutional layers, pooling layers, and nonlinear functions. However, the feature map is obtained by the convolutional layer, which has a lot of redundancy, so we use octave convolution instead of vanilla convolution to improve compression efficiency. The feature map can be divided into high-frequency and lowfrequency information. We use octave convolution to design an automatic codec to decompose the feature map into high-frequency and low-frequency information, which effectively improves the quality of the generated image. First, the semantic segmentation map of the input image is obtained by pre-training SegNet. The ComNet uses the original image and the semantic segmentation map to generate a low-dimensional representation, and the GenNet network utilizes the low-dimensional representation and the semantic segmentation map to estimate images. Then, the residuals between the reconstructed image and the original image are encoded. Finally, the reconstructed image and the decoded residual image are used to obtain the final high-quality reconstruction. Experimental results show that our method outperforms the existing image coding standards in terms of PSNR and MS-SSIM at different bit rates, and the reconstruction of images with complex textures and semantics has more obvious advantages. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Real-Time Semantic Segmentation Network Based on Octave Convolution
    Wang Xin
    Wu Kaijun
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (08)
  • [2] Gaussian Dilated Convolution for Semantic Image Segmentation
    Shen, Falong
    Zeng, Gang
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 324 - 334
  • [3] Image Semantic Segmentation Method Based on Atrous Algorithm and Convolution CRF
    Lv, Linjue
    Li, Xingwei
    Jin, Jiating
    Li, Xinlong
    PROCEEDINGS OF 2019 IEEE 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2019), 2019, : 160 - 165
  • [4] A Novel Upsampling and Context Convolution for Image Semantic Segmentation
    Sediqi, Khwaja Monib
    Lee, Hyo Jong
    SENSORS, 2021, 21 (06) : 1 - 16
  • [5] An enhancement model based on dense atrous and inception convolution for image semantic segmentation
    Zhou, Erjing
    Xu, Xiang
    Xu, Baomin
    Wu, Hongwei
    APPLIED INTELLIGENCE, 2023, 53 (05) : 5519 - 5531
  • [6] A New Method for Retinal Image Semantic Segmentation Based on Fully Convolution Network
    Cao, Yuning
    Ban, Xiaojuan
    Han, Zhishuai
    Shen, Bingyang
    THEORETICAL COMPUTER SCIENCE (NCTCS 2018), 2018, 882 : 27 - 45
  • [7] An enhancement model based on dense atrous and inception convolution for image semantic segmentation
    Erjing Zhou
    Xiang Xu
    Baomin Xu
    Hongwei Wu
    Applied Intelligence, 2023, 53 : 5519 - 5531
  • [8] Image Semantic Segmentation Scheme based on XGBoost combination with Convolution Feature Extraction
    Dai, Zichen
    Liu, Xuewen
    Xu, Chi
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6334 - 6340
  • [9] DSSLIC: DEEP SEMANTIC SEGMENTATION-BASED LAYERED IMAGE COMPRESSION
    Akbari, Mohammad
    Liang, Jie
    Han, Jingning
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2042 - 2046
  • [10] Scale-equivariant convolution for semantic segmentation of depth image
    Marumo, Hidetaka
    Matsubara, Takashi
    IEICE NONLINEAR THEORY AND ITS APPLICATIONS, 2024, 15 (01): : 36 - 53