Channel2DTransformer: A Multi-level Features Self-attention Fusion Module for Semantic Segmentation

被引:0
作者
Liu, Weitao [1 ]
Wu, Junjun [1 ]
机构
[1] Foshan Univ, Guangdong Prov Key Lab Ind Intelligent Inspection, Foshan, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Semantic segmentation; Channel2DTransformer; Self-attention; Deep learning;
D O I
10.1007/s44196-024-00630-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation is a crucial technology for intelligent vehicles, enabling scene understanding in complex driving environments. However, complex real-world scenarios often contain diverse multi-scale objects, which bring challenges to the accurate semantic segmentation. To address this challenge, we propose a multi-level features self-attention fusion module called Channel2DTransformer. The module utilizes self-attention mechanisms to dynamically fuse multi-level features by computing self-attention weights between their channels, resulting in a consistent and comprehensive representation of scene features. We perform the module on the Cityscapes and NYUDepthV2 datasets, which contain a large number of multi-scale objects. The experimental results validate the positive contributions of the module in enhancing the semantic segmentation accuracy of multi-scale objects and improving the performance of semantic segmentation in complex scenes.
引用
收藏
页数:11
相关论文
共 44 条
  • [1] Self-attention feature fusion network for semantic segmentation
    Zhou, Zhen
    Zhou, Yan
    Wang, Dongli
    Mu, Jinzhen
    Zhou, Haibin
    NEUROCOMPUTING, 2021, 453 : 50 - 59
  • [2] Lunet: an enhanced upsampling fusion network with efficient self-attention for semantic segmentation
    Zhou, Yan
    Zhou, Haibin
    Yang, Yin
    Li, Jianxun
    Irampaye, Richard
    Wang, Dongli
    Zhang, Zhengpeng
    VISUAL COMPUTER, 2024, : 3109 - 3128
  • [3] Multi-layered self-attention mechanism for weakly supervised semantic segmentation
    Yaganapu, Avinash
    Kang, Mingon
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 239
  • [4] Multi-level feature re-weighted fusion for the semantic segmentation of crops and weeds
    Janneh, Lamin L.
    Zhang, Yongjun
    Cui, Zhongwei
    Yang, Yitong
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (06)
  • [5] SATSal: A Multi-Level Self-Attention Based Architecture for Visual Saliency Prediction
    Tliba, Marouane
    Kerkouri, Mohamed A.
    Ghariba, Bashir
    Chetouani, Aladine
    Coeltekin, Arzu
    Shehata, Mohamed
    Bruno, Alessandro
    IEEE ACCESS, 2022, 10 : 20701 - 20713
  • [6] Multi-type and Multi-level Feature Fusion Network for RGBD Indoor Semantic Segmentation
    Xia, Yuwen
    Gu, Chaochen
    Wu, Kaijie
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 6142 - 6148
  • [7] Small-scale Image Semantic Segmentation Method Based on Multi-level Superposition and Enhancement Fusion
    Su, Xiaodong
    Liang, Hongyu
    Yao, Guilin
    Li, Hui
    Li, Shizhou
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 1502 - 1507
  • [8] MASANet: Multi-Angle Self-Attention Network for Semantic Segmentation of Remote Sensing Images
    Zeng, Fuping
    Yang, Bin
    Zhao, Mengci
    Xing, Ying
    Ma, Yiran
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2022, 29 (05): : 1567 - 1575
  • [9] Semantic Segmentation Algorithm Based Multi-headed Self-attention for Tea Picking Points
    Song Y.
    Yang S.
    Zheng Z.
    Ning J.
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2023, 54 (09): : 297 - 305
  • [10] Traffic Scene Semantic Segmentation Algorithm with Knowledge Distillation of Multi-level Features Guided by Boundary Perception
    Xie, Xinlin
    Duan, Zeyun
    Luo, Chenyan
    Xie, Gang
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2024, 37 (09): : 770 - 785