SEFANet: Semantic enhanced with feature alignment network for semantic segmentation

被引:1
|
作者
Wang, Dakai [1 ]
An, Wenhao [1 ]
Ma, Jianxin [1 ]
Wang, Li [2 ]
机构
[1] Henan Univ, Sch Artificial Intelligence, Zhengzhou 450046, Peoples R China
[2] Henan Univ, Eurasia Int Sch, Kaifeng 475001, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantic segmentation; Perceptual enhancement; Group convolution; Feature alignment;
D O I
10.1016/j.dsp.2024.104639
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
To address the issues of poor segmentation accuracy and insensitivity to details in semantic segmentation, this paper proposes a novel image semantic segmentation framework SEFANet. Specifically, SEFANet adopts encoder-decoder structure, and incorporates a novel perceptual enhancement mechanism called Multi-scale Spatial Integration Module (MSIM) at the encoder. MSIM is based on group convolution to boost spatial semantic features and refine spatial-gradient semantic features within the multi-scale structure. This module enhances feature extraction across different network levels, leading to improved edge detection and segmentation abilities. In the decoder, SEFANet introduces a pixel-level Interleaved Feature Alignment Module (IFAM), which leverages rich semantic information in low-dimensional features and the strategy of Semantic Offset Field. Meanwhile, IFAM warps the high-dimensional feature map into low-dimensional features, completing the calibration process through convolution operations. Experimental results on the Pascal VOC2012 val dataset and the Cityscapes val dataset confirm the effectiveness and generalization of the proposed semantic segmentation. Additionally, the results further demonstrate that SEFANet improves the poor segmentation accuracy and insensitivity to details, and achieves a competitive performance compared with other semantic segmentation methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Enhanced Feature Pyramid Network for Semantic Segmentation
    Ye, Mucong
    Ouyang, Jingpeng
    Chen, Ge
    Zhang, Jing
    Yu, Xiaogang
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3209 - 3216
  • [2] Enhanced Feature Alignment for Unsupervised Domain Adaptation of Semantic Segmentation
    Chen, Tao
    Wang, Shui-Hua
    Wang, Qiong
    Zhang, Zheng
    Xie, Guo-Sen
    Tang, Zhenmin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1042 - 1054
  • [3] Enhanced-feature pyramid network for semantic segmentation
    Quyen, Van Toan
    Lee, Jong Hyuk
    Kim, Min Young
    2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION, ICAIIC, 2023, : 782 - 787
  • [4] Distilled Heterogeneous Feature Alignment Network for SAR Image Semantic Segmentation
    Gao, Mengyu
    Xu, Jiping
    Yu, Jiabin
    Dong, Qiulei
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [5] Salient feature network for semantic segmentation
    Zhou, Yan
    Zhou, Zhen
    Zhou, Haibin
    Mu, Jinzhen
    Wang, Dongli
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (03) : 763 - 771
  • [6] Salient feature network for semantic segmentation
    Yan Zhou
    Zhen Zhou
    Haibin Zhou
    Jinzhen Mu
    Dongli Wang
    Signal, Image and Video Processing, 2022, 16 : 763 - 771
  • [7] Feature Enhanced Projection Network for Zero-shot Semantic Segmentation
    Lu, Hongchao
    Fang, Longwei
    Lin, Matthieu
    Deng, Zhidong
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 14011 - 14017
  • [8] FPANet: Feature-enhanced position attention network for semantic segmentation
    Haixia Xu
    Shuailong Wang
    Yunjia Huang
    Wei Zhou
    Qi Chen
    Dongbo Zhang
    Machine Vision and Applications, 2021, 32
  • [9] ESSN: Enhanced Semantic Segmentation Network by Residual Concatenation of Feature Maps
    Kim, Dong Seop
    Arsalan, Muhammad
    Owais, Muhammad
    Park, Kang Ryoung
    IEEE ACCESS, 2020, 8 : 21363 - 21379
  • [10] Learning Implicit Feature Alignment Function for Semantic Segmentation
    Hu, Hanzhe
    Chen, Yinbo
    Xu, Jiarui
    Borse, Shubhankar
    Cai, Hong
    Porikli, Fatih
    Wang, Xiaolong
    COMPUTER VISION, ECCV 2022, PT XXIX, 2022, 13689 : 487 - 505