Semantic segmentation using cross-stage feature reweighting and efficient self-attention

被引：1

作者：

Ma, Yingdong ^{[1
]}

Lan, Xiaobin ^{[1
]}

机构：

[1] Inner Mongolia Univ, Coll Comp Sci, 235 West Daxue Rd, Hohhot, Peoples R China

来源：

IMAGE AND VISION COMPUTING | 2024年 / 145卷

关键词：

Semantic segmentation; Convolutional neural networks; Transformer; Feature fusion and reweighting; NETWORK;

D O I：

10.1016/j.imavis.2024.104996

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, vision transformers have demonstrated strong performance in various computer vision tasks. The success of ViTs can be attribute to the ability of capturing long-range dependencies. However, transformer-based approaches often yield segmentation maps with incomplete object structures because of restricted cross-stage information propagation and lack of low-level details. To address these problems, we introduce a CNNtransformer semantic segmentation architecture which adopts a CNN backbone for multi-level feature extraction and a transformer encoder that focuses on global perception learning. Transformer embeddings of all stages are integrated to compute feature weights for dynamic cross-stage feature reweighting. As a result, high-level semantic context and low-level spatial details can be embedded into each stage to preserve multi-level information. An efficient attention-based feature fusion mechanism is developed to combine reweighted transformer embeddings with CNN features to generate segmentation maps with more complete object structure. Different from regular self-attention that has quadratic computational complexity, our efficient self-attention method achieves similar performance with linear complexity. Experimental results on ADE20K and Cityscapes datasets show that the proposed segmentation approach demonstrates superior performance against most state-of-the-art networks.

引用

页数：11

共 50 条

[21] CaSaFormer: A cross- and self-attention based lightweight network for large-scale building semantic segmentation
Li, Jiayi
Hu, Yuping
Huang, Xin
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 130
[22] UNSUPERVISED DOMAIN ADAPTION FOR REMOTE SENSING SEMANTIC SEGMENTATION WITH SELF-ATTENTION MECHANISM
Liu, Keming
Liu, Fang
Liu, Jia
Xiao, Liang
Tang, Xu
IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6916 - 6919
[23] Cross-form efficient attention pyramidal network for semantic image segmentation
Maurya, Anamika
Chand, Satish
AI COMMUNICATIONS, 2022, 35 (03) : 225 - 242
[24] 1D Self-Attention Network for Point Cloud Semantic Segmentation Using Omnidirectional LiDAR
Suzuki, Takahiro
Hirakawa, Tsubasa
Yamashita, Takayoshi
Fujiyoshi, Hironobu
PATTERN RECOGNITION, ACPR 2021, PT I, 2022, 13188 : 257 - 270
[25] Self-Attention Technology in Image Segmentation
Cao, Fude
Lu, Xueyun
INTERNATIONAL CONFERENCE ON INTELLIGENT TRAFFIC SYSTEMS AND SMART CITY (ITSSC 2021), 2022, 12165
[26] Research of Self-Attention in Image Segmentation
Cao, Fude
Zheng, Chunguang
Huang, Limin
Wang, Aihua
Zhang, Jiong
Zhou, Feng
Ju, Haoxue
Guo, Haitao
Du, Yuxia
JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2022, 15 (01)
[27] Using Guided Self-Attention with Local Information for Polyp Segmentation
Cai, Linghan
Wu, Meijing
Chen, Lijiang
Bai, Wenpei
Yang, Min
Lyu, Shuchang
Zhao, Qi
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT IV, 2022, 13434 : 629 - 638
[28] A global reweighting approach for cross-domain semantic segmentation
Zhang, Yuhang
Tian, Shishun
Liao, Muxin
Hua, Guoguang
Zou, Wenbin
Xu, Chen
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2025, 130
[29] Semantic segmentation of 3D point cloud based on self-attention feature fusion group convolutional neural network
Yang J.
Li B.
Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2022, 30 (07): : 840 - 853
[30] A Model for Sea Ice Segmentation based on Feature Pyramid Network and Multi-head Self-attention
Xu, Yuanxiang
Feng, Yuan
Song, Shengyu
Liu, Jiahao
PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 97 - 102

← 1 2 3 4 5 →