Channel2DTransformer: A Multi-level Features Self-attention Fusion Module for Semantic Segmentation

被引：0

作者：

Liu, Weitao ^{[1
]}

Wu, Junjun ^{[1
]}

机构：

[1] Foshan Univ, Guangdong Prov Key Lab Ind Intelligent Inspection, Foshan, Peoples R China

来源：

INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS | 2024年 / 17卷 / 01期

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Semantic segmentation; Channel2DTransformer; Self-attention; Deep learning;

D O I：

10.1007/s44196-024-00630-5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Semantic segmentation is a crucial technology for intelligent vehicles, enabling scene understanding in complex driving environments. However, complex real-world scenarios often contain diverse multi-scale objects, which bring challenges to the accurate semantic segmentation. To address this challenge, we propose a multi-level features self-attention fusion module called Channel2DTransformer. The module utilizes self-attention mechanisms to dynamically fuse multi-level features by computing self-attention weights between their channels, resulting in a consistent and comprehensive representation of scene features. We perform the module on the Cityscapes and NYUDepthV2 datasets, which contain a large number of multi-scale objects. The experimental results validate the positive contributions of the module in enhancing the semantic segmentation accuracy of multi-scale objects and improving the performance of semantic segmentation in complex scenes.

引用

页数：11

共 44 条

[1] Self-attention feature fusion network for semantic segmentation
Zhou, Zhen
Zhou, Yan
Wang, Dongli
Mu, Jinzhen
Zhou, Haibin
NEUROCOMPUTING, 2021, 453 : 50 - 59
[2] Lunet: an enhanced upsampling fusion network with efficient self-attention for semantic segmentation
Zhou, Yan
Zhou, Haibin
Yang, Yin
Li, Jianxun
Irampaye, Richard
Wang, Dongli
Zhang, Zhengpeng
VISUAL COMPUTER, 2024, : 3109 - 3128
[3] Multi-layered self-attention mechanism for weakly supervised semantic segmentation
Yaganapu, Avinash
Kang, Mingon
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 239
[4] Multi-level feature re-weighted fusion for the semantic segmentation of crops and weeds
Janneh, Lamin L.
Zhang, Yongjun
Cui, Zhongwei
Yang, Yitong
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (06)
[5] SATSal: A Multi-Level Self-Attention Based Architecture for Visual Saliency Prediction
Tliba, Marouane
Kerkouri, Mohamed A.
Ghariba, Bashir
Chetouani, Aladine
Coeltekin, Arzu
Shehata, Mohamed
Bruno, Alessandro
IEEE ACCESS, 2022, 10 : 20701 - 20713
[6] Multi-type and Multi-level Feature Fusion Network for RGBD Indoor Semantic Segmentation
Xia, Yuwen
Gu, Chaochen
Wu, Kaijie
2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 6142 - 6148
[7] Small-scale Image Semantic Segmentation Method Based on Multi-level Superposition and Enhancement Fusion
Su, Xiaodong
Liang, Hongyu
Yao, Guilin
Li, Hui
Li, Shizhou
PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 1502 - 1507
[8] MASANet: Multi-Angle Self-Attention Network for Semantic Segmentation of Remote Sensing Images
Zeng, Fuping
Yang, Bin
Zhao, Mengci
Xing, Ying
Ma, Yiran
TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2022, 29 (05): : 1567 - 1575
[9] Semantic Segmentation Algorithm Based Multi-headed Self-attention for Tea Picking Points
Song Y.
Yang S.
Zheng Z.
Ning J.
Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2023, 54 (09): : 297 - 305
[10] Traffic Scene Semantic Segmentation Algorithm with Knowledge Distillation of Multi-level Features Guided by Boundary Perception
Xie, Xinlin
Duan, Zeyun
Luo, Chenyan
Xie, Gang
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2024, 37 (09): : 770 - 785

← 1 2 3 4 5 →