Channel2DTransformer: A Multi-level Features Self-attention Fusion Module for Semantic Segmentation

被引:0
作者
Liu, Weitao [1 ]
Wu, Junjun [1 ]
机构
[1] Foshan Univ, Guangdong Prov Key Lab Ind Intelligent Inspection, Foshan, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Semantic segmentation; Channel2DTransformer; Self-attention; Deep learning;
D O I
10.1007/s44196-024-00630-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation is a crucial technology for intelligent vehicles, enabling scene understanding in complex driving environments. However, complex real-world scenarios often contain diverse multi-scale objects, which bring challenges to the accurate semantic segmentation. To address this challenge, we propose a multi-level features self-attention fusion module called Channel2DTransformer. The module utilizes self-attention mechanisms to dynamically fuse multi-level features by computing self-attention weights between their channels, resulting in a consistent and comprehensive representation of scene features. We perform the module on the Cityscapes and NYUDepthV2 datasets, which contain a large number of multi-scale objects. The experimental results validate the positive contributions of the module in enhancing the semantic segmentation accuracy of multi-scale objects and improving the performance of semantic segmentation in complex scenes.
引用
收藏
页数:11
相关论文
共 44 条
  • [41] MSU-Net: Multi-Scale self-attention semantic segmentation method for oil-tea camellia planting area extraction in hilly areas of southern China
    Xu, Zikun
    Li, Hengkai
    Long, Beiping
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 263
  • [42] Chinese Named Entity Recognition for Dairy Cow Diseases by Fusion of Multi-Semantic Features Using Self-Attention-Based Deep Learning
    Lou, Yongjun
    Gao, Meng
    Zhang, Shuo
    Yang, Hongjun
    Wang, Sicong
    He, Yongqiang
    Yang, Jing
    Yang, Wenxia
    Du, Haitao
    Shen, Weizheng
    ANIMALS, 2025, 15 (06):
  • [43] FCSU-Net: A novel full-scale Cross-dimension Self-attention U-Net with collaborative fusion of multi-scale feature for medical image segmentation
    Xu, Shijie
    Chen, Yufeng
    Yang, Shukai
    Zhang, Xiaoqian
    Sun, Feng
    Computers in Biology and Medicine, 2024, 180
  • [44] LSAM: L2-norm self-attention and latent space feature interaction for automatic 3D multi-modal head and neck tumor segmentation
    Li, Laquan
    Tan, Jiaxin
    Yu, Lei
    Li, Chunwen
    Nan, Hai
    Zheng, Shenhai
    PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (22)