Semantic segmentation using cross-stage feature reweighting and efficient self-attention

被引:1
作者
Ma, Yingdong [1 ]
Lan, Xiaobin [1 ]
机构
[1] Inner Mongolia Univ, Coll Comp Sci, 235 West Daxue Rd, Hohhot, Peoples R China
关键词
Semantic segmentation; Convolutional neural networks; Transformer; Feature fusion and reweighting; NETWORK;
D O I
10.1016/j.imavis.2024.104996
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, vision transformers have demonstrated strong performance in various computer vision tasks. The success of ViTs can be attribute to the ability of capturing long-range dependencies. However, transformer-based approaches often yield segmentation maps with incomplete object structures because of restricted cross-stage information propagation and lack of low-level details. To address these problems, we introduce a CNNtransformer semantic segmentation architecture which adopts a CNN backbone for multi-level feature extraction and a transformer encoder that focuses on global perception learning. Transformer embeddings of all stages are integrated to compute feature weights for dynamic cross-stage feature reweighting. As a result, high-level semantic context and low-level spatial details can be embedded into each stage to preserve multi-level information. An efficient attention-based feature fusion mechanism is developed to combine reweighted transformer embeddings with CNN features to generate segmentation maps with more complete object structure. Different from regular self-attention that has quadratic computational complexity, our efficient self-attention method achieves similar performance with linear complexity. Experimental results on ADE20K and Cityscapes datasets show that the proposed segmentation approach demonstrates superior performance against most state-of-the-art networks.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] CaSaFormer: A cross- and self-attention based lightweight network for large-scale building semantic segmentation
    Li, Jiayi
    Hu, Yuping
    Huang, Xin
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 130
  • [22] UNSUPERVISED DOMAIN ADAPTION FOR REMOTE SENSING SEMANTIC SEGMENTATION WITH SELF-ATTENTION MECHANISM
    Liu, Keming
    Liu, Fang
    Liu, Jia
    Xiao, Liang
    Tang, Xu
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6916 - 6919
  • [23] Cross-form efficient attention pyramidal network for semantic image segmentation
    Maurya, Anamika
    Chand, Satish
    AI COMMUNICATIONS, 2022, 35 (03) : 225 - 242
  • [24] 1D Self-Attention Network for Point Cloud Semantic Segmentation Using Omnidirectional LiDAR
    Suzuki, Takahiro
    Hirakawa, Tsubasa
    Yamashita, Takayoshi
    Fujiyoshi, Hironobu
    PATTERN RECOGNITION, ACPR 2021, PT I, 2022, 13188 : 257 - 270
  • [25] Self-Attention Technology in Image Segmentation
    Cao, Fude
    Lu, Xueyun
    INTERNATIONAL CONFERENCE ON INTELLIGENT TRAFFIC SYSTEMS AND SMART CITY (ITSSC 2021), 2022, 12165
  • [26] Research of Self-Attention in Image Segmentation
    Cao, Fude
    Zheng, Chunguang
    Huang, Limin
    Wang, Aihua
    Zhang, Jiong
    Zhou, Feng
    Ju, Haoxue
    Guo, Haitao
    Du, Yuxia
    JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2022, 15 (01)
  • [27] Using Guided Self-Attention with Local Information for Polyp Segmentation
    Cai, Linghan
    Wu, Meijing
    Chen, Lijiang
    Bai, Wenpei
    Yang, Min
    Lyu, Shuchang
    Zhao, Qi
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT IV, 2022, 13434 : 629 - 638
  • [28] A global reweighting approach for cross-domain semantic segmentation
    Zhang, Yuhang
    Tian, Shishun
    Liao, Muxin
    Hua, Guoguang
    Zou, Wenbin
    Xu, Chen
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2025, 130
  • [29] Semantic segmentation of 3D point cloud based on self-attention feature fusion group convolutional neural network
    Yang J.
    Li B.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2022, 30 (07): : 840 - 853
  • [30] A Model for Sea Ice Segmentation based on Feature Pyramid Network and Multi-head Self-attention
    Xu, Yuanxiang
    Feng, Yuan
    Song, Shengyu
    Liu, Jiahao
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 97 - 102