SEFANet: Semantic enhanced with feature alignment network for semantic segmentation

被引:1
作者
Wang, Dakai [1 ]
An, Wenhao [1 ]
Ma, Jianxin [1 ]
Wang, Li [2 ]
机构
[1] Henan Univ, Sch Artificial Intelligence, Zhengzhou 450046, Peoples R China
[2] Henan Univ, Eurasia Int Sch, Kaifeng 475001, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantic segmentation; Perceptual enhancement; Group convolution; Feature alignment;
D O I
10.1016/j.dsp.2024.104639
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
To address the issues of poor segmentation accuracy and insensitivity to details in semantic segmentation, this paper proposes a novel image semantic segmentation framework SEFANet. Specifically, SEFANet adopts encoder-decoder structure, and incorporates a novel perceptual enhancement mechanism called Multi-scale Spatial Integration Module (MSIM) at the encoder. MSIM is based on group convolution to boost spatial semantic features and refine spatial-gradient semantic features within the multi-scale structure. This module enhances feature extraction across different network levels, leading to improved edge detection and segmentation abilities. In the decoder, SEFANet introduces a pixel-level Interleaved Feature Alignment Module (IFAM), which leverages rich semantic information in low-dimensional features and the strategy of Semantic Offset Field. Meanwhile, IFAM warps the high-dimensional feature map into low-dimensional features, completing the calibration process through convolution operations. Experimental results on the Pascal VOC2012 val dataset and the Cityscapes val dataset confirm the effectiveness and generalization of the proposed semantic segmentation. Additionally, the results further demonstrate that SEFANet improves the poor segmentation accuracy and insensitivity to details, and achieves a competitive performance compared with other semantic segmentation methods.
引用
收藏
页数:12
相关论文
共 54 条
  • [1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [2] Loss Max-Pooling for Semantic Image Segmentation
    Bulo, Samuel Rota
    Neuhold, Gerhard
    Kontschieder, Peter
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 7082 - 7091
  • [3] Chen L.-C., 2017, P IEEE CVF C COMP VI
  • [4] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [5] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [6] SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning
    Chen, Long
    Zhang, Hanwang
    Xiao, Jun
    Nie, Liqiang
    Shao, Jian
    Liu, Wei
    Chua, Tat-Seng
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6298 - 6306
  • [7] Activating More Pixels in Image Super-Resolution Transformer
    Chen, Xiangyu
    Wang, Xintao
    Zhou, Jiantao
    Qiao, Yu
    Dong, Chao
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 22367 - 22377
  • [8] Xception: Deep Learning with Depthwise Separable Convolutions
    Chollet, Francois
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1800 - 1807
  • [9] Chu XX, 2021, ADV NEUR IN
  • [10] Dual Attention Network for Scene Segmentation
    Fu, Jun
    Liu, Jing
    Tian, Haijie
    Li, Yong
    Bao, Yongjun
    Fang, Zhiwei
    Lu, Hanqing
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3141 - 3149