Channel2DTransformer: A Multi-level Features Self-attention Fusion Module for Semantic Segmentation

被引:0
作者
Liu, Weitao [1 ]
Wu, Junjun [1 ]
机构
[1] Foshan Univ, Guangdong Prov Key Lab Ind Intelligent Inspection, Foshan, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Semantic segmentation; Channel2DTransformer; Self-attention; Deep learning;
D O I
10.1007/s44196-024-00630-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation is a crucial technology for intelligent vehicles, enabling scene understanding in complex driving environments. However, complex real-world scenarios often contain diverse multi-scale objects, which bring challenges to the accurate semantic segmentation. To address this challenge, we propose a multi-level features self-attention fusion module called Channel2DTransformer. The module utilizes self-attention mechanisms to dynamically fuse multi-level features by computing self-attention weights between their channels, resulting in a consistent and comprehensive representation of scene features. We perform the module on the Cityscapes and NYUDepthV2 datasets, which contain a large number of multi-scale objects. The experimental results validate the positive contributions of the module in enhancing the semantic segmentation accuracy of multi-scale objects and improving the performance of semantic segmentation in complex scenes.
引用
收藏
页数:11
相关论文
共 44 条
  • [31] SE2Net: semantic segmentation of remote sensing images based on self-attention and edge enhancement modules
    Liu, Songlin
    Gao, Kai
    Qin, Jinchun
    Gong, Hui
    Wang, Haiyan
    Zhang, Li
    Gong, Danchao
    JOURNAL OF APPLIED REMOTE SENSING, 2021, 15 (02)
  • [32] UAM-Net: An Attention-Based Multi-level Feature Fusion UNet for Remote Sensing Image Segmentation
    Cao, Yiwen
    Jiang, Nanfeng
    Wang, Da-Han
    Wu, Yun
    Zhu, Shunzhi
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IV, 2024, 14428 : 267 - 278
  • [33] Semantic Segmentation of Urban Airborne LiDAR Point Clouds Based on Fusion Attention Mechanism and Multi-Scale Features
    Wang, Jingxue
    Li, Huan
    Xu, Zhenghui
    Xie, Xiao
    REMOTE SENSING, 2023, 15 (21)
  • [34] A self-attention based global feature enhancing network for semantic segmentation of large-scale urban street-level point clouds
    Chen, Qi
    Zhang, Zhenxin
    Chen, Siyun
    Wen, Siyuan
    Ma, Hao
    Xu, Zhihua
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 113
  • [35] Multi-Level Branch Cross-Scale Fusion Network for High-Precision Semantic Segmentation in Complex Remote Sensing Environments
    Zeng, Junying
    Deng, Senyao
    Qin, Chuanbo
    Zhai, Yikui
    Jia, Xudong
    Gu, Yajin
    Xu, Jiahua
    LASER & OPTOELECTRONICS PROGRESS, 2025, 62 (04)
  • [36] DS-MSFF-Net: Dual-path self-attention multi-scale feature fusion network for CT image segmentation
    Zhang, Xiaoqian
    Pu, Lei
    Wan, Liming
    Wang, Xiao
    Zhou, Ying
    APPLIED INTELLIGENCE, 2024, 54 (06) : 4490 - 4506
  • [37] DS-MSFF-Net: Dual-path self-attention multi-scale feature fusion network for CT image segmentation
    Xiaoqian Zhang
    Lei Pu
    Liming Wan
    Xiao Wang
    Ying Zhou
    Applied Intelligence, 2024, 54 : 4490 - 4506
  • [38] A novel MCF-Net: Multi-level context fusion network for 2D medical image segmentation
    Liu, Lizhu
    Liu, Yexin
    Zhou, Jian
    Guo, Cheng
    Duan, Huigao
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 226
  • [39] M2FNet: multi-modality multi-level fusion network for segmentation of acute and sub-acute ischemic stroke
    Shannan Chen
    Xuanhe Zhao
    Yang Duan
    Ronghui Ju
    Peizhuo Zang
    Shouliang Qi
    Complex & Intelligent Systems, 2025, 11 (6)
  • [40] AVR (advancing video retrieval): A new framework guided by multi-level fusion of visual and semantic Features for deep learning-based concept detection
    Mohamed Hamroun
    Sonia Lajmi
    Maryam Jallouli
    Multimedia Tools and Applications, 2025, 84 (5) : 2715 - 2777