Channel2DTransformer: A Multi-level Features Self-attention Fusion Module for Semantic Segmentation

被引:0
作者
Liu, Weitao [1 ]
Wu, Junjun [1 ]
机构
[1] Foshan Univ, Guangdong Prov Key Lab Ind Intelligent Inspection, Foshan, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Semantic segmentation; Channel2DTransformer; Self-attention; Deep learning;
D O I
10.1007/s44196-024-00630-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation is a crucial technology for intelligent vehicles, enabling scene understanding in complex driving environments. However, complex real-world scenarios often contain diverse multi-scale objects, which bring challenges to the accurate semantic segmentation. To address this challenge, we propose a multi-level features self-attention fusion module called Channel2DTransformer. The module utilizes self-attention mechanisms to dynamically fuse multi-level features by computing self-attention weights between their channels, resulting in a consistent and comprehensive representation of scene features. We perform the module on the Cityscapes and NYUDepthV2 datasets, which contain a large number of multi-scale objects. The experimental results validate the positive contributions of the module in enhancing the semantic segmentation accuracy of multi-scale objects and improving the performance of semantic segmentation in complex scenes.
引用
收藏
页数:11
相关论文
共 44 条
  • [21] Criss-Cross Attention Based Multi-level Fusion Network for Gastric Intestinal Metaplasia Segmentation
    Nien, Chu-Min
    Yang, Er-Hsiang
    Chang, Wei-Lun
    Cheng, Hsiu-Chi
    Huang, Chun-Rong
    IMAGING SYSTEMS FOR GI ENDOSCOPY, AND GRAPHS IN BIOMEDICAL IMAGE ANALYSIS, ISGIE 2022, 2022, 13754 : 13 - 23
  • [22] Semantic Segmentation of Remote Sensing Image via Self-Attention-Based Multi-Scale Feature Fusion
    Guo D.
    Fu Y.
    Zhu Y.
    Wen W.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (08): : 1259 - 1268
  • [23] A lightweight network for abdominal multi-organ segmentation based on multi-scale context fusion and dual self-attention
    Liao, Miao
    Tang, Hongliang
    Li, Xiong
    Vijayakumar, P.
    Arya, Varsha
    Gupta, Brij B.
    INFORMATION FUSION, 2024, 108
  • [24] A Self-Attention-Based Multi-Level Fusion Network for Aspect Category Sentiment Analysis
    Dong Tian
    Jia Shi
    Jianying Feng
    Cognitive Computation, 2023, 15 : 1372 - 1390
  • [25] A Self-Attention-Based Multi-Level Fusion Network for Aspect Category Sentiment Analysis
    Tian, Dong
    Shi, Jia
    Feng, Jianying
    COGNITIVE COMPUTATION, 2023, 15 (04) : 1372 - 1390
  • [26] Semantic segmentation of 3D point cloud based on self-attention feature fusion group convolutional neural network
    Yang J.
    Li B.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2022, 30 (07): : 840 - 853
  • [27] MFAFNet: A Lightweight and Efficient Network with Multi-Level Feature Adaptive Fusion for Real-Time Semantic Segmentation
    Lu, Kai
    Cheng, Jieren
    Li, Hua
    Ouyang, Tianyu
    SENSORS, 2023, 23 (14)
  • [28] CFSA-Net: Efficient Large-Scale Point Cloud Semantic Segmentation Based on Cross-Fusion Self-Attention
    Shu, Jun
    Wang, Shuai
    Yu, Shiqi
    Zhang, Jie
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (03): : 2677 - 2697
  • [29] mTBI-DSANet: A deep self-attention model for diagnosing mild traumatic brain injury using multi-level functional connectivity networks
    Teng, Jing
    Mi, Chunlin
    Liu, Wuyi
    Shi, Jian
    Li, Na
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 152
  • [30] MR-FPN: Multi-Level Residual Feature Pyramid Text Detection Network Based on Self-Attention Environment
    Kang, Jianjun
    Ibrayim, Mayire
    Hamdulla, Askar
    SENSORS, 2022, 22 (09)