Channel2DTransformer: A Multi-level Features Self-attention Fusion Module for Semantic Segmentation

被引：0

作者：

Liu, Weitao ^{[1
]}

Wu, Junjun ^{[1
]}

机构：

[1] Foshan Univ, Guangdong Prov Key Lab Ind Intelligent Inspection, Foshan, Peoples R China

来源：

INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS | 2024年 / 17卷 / 01期

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Semantic segmentation; Channel2DTransformer; Self-attention; Deep learning;

D O I：

10.1007/s44196-024-00630-5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Semantic segmentation is a crucial technology for intelligent vehicles, enabling scene understanding in complex driving environments. However, complex real-world scenarios often contain diverse multi-scale objects, which bring challenges to the accurate semantic segmentation. To address this challenge, we propose a multi-level features self-attention fusion module called Channel2DTransformer. The module utilizes self-attention mechanisms to dynamically fuse multi-level features by computing self-attention weights between their channels, resulting in a consistent and comprehensive representation of scene features. We perform the module on the Cityscapes and NYUDepthV2 datasets, which contain a large number of multi-scale objects. The experimental results validate the positive contributions of the module in enhancing the semantic segmentation accuracy of multi-scale objects and improving the performance of semantic segmentation in complex scenes.

引用

页数：11

共 44 条

[41] MSU-Net: Multi-Scale self-attention semantic segmentation method for oil-tea camellia planting area extraction in hilly areas of southern China
Xu, Zikun
Li, Hengkai
Long, Beiping
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 263
[42] Chinese Named Entity Recognition for Dairy Cow Diseases by Fusion of Multi-Semantic Features Using Self-Attention-Based Deep Learning
Lou, Yongjun
Gao, Meng
Zhang, Shuo
Yang, Hongjun
Wang, Sicong
He, Yongqiang
Yang, Jing
Yang, Wenxia
Du, Haitao
Shen, Weizheng
ANIMALS, 2025, 15 (06):
[43] FCSU-Net: A novel full-scale Cross-dimension Self-attention U-Net with collaborative fusion of multi-scale feature for medical image segmentation
Xu, Shijie
Chen, Yufeng
Yang, Shukai
Zhang, Xiaoqian
Sun, Feng
Computers in Biology and Medicine, 2024, 180
[44] LSAM: L2-norm self-attention and latent space feature interaction for automatic 3D multi-modal head and neck tumor segmentation
Li, Laquan
Tan, Jiaxin
Yu, Lei
Li, Chunwen
Nan, Hai
Zheng, Shenhai
PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (22)

← 1 2 3 4 5 →