Channel2DTransformer: A Multi-level Features Self-attention Fusion Module for Semantic Segmentation

被引：0

作者：

Liu, Weitao ^{[1
]}

Wu, Junjun ^{[1
]}

机构：

[1] Foshan Univ, Guangdong Prov Key Lab Ind Intelligent Inspection, Foshan, Peoples R China

来源：

INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS | 2024年 / 17卷 / 01期

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Semantic segmentation; Channel2DTransformer; Self-attention; Deep learning;

D O I：

10.1007/s44196-024-00630-5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Semantic segmentation is a crucial technology for intelligent vehicles, enabling scene understanding in complex driving environments. However, complex real-world scenarios often contain diverse multi-scale objects, which bring challenges to the accurate semantic segmentation. To address this challenge, we propose a multi-level features self-attention fusion module called Channel2DTransformer. The module utilizes self-attention mechanisms to dynamically fuse multi-level features by computing self-attention weights between their channels, resulting in a consistent and comprehensive representation of scene features. We perform the module on the Cityscapes and NYUDepthV2 datasets, which contain a large number of multi-scale objects. The experimental results validate the positive contributions of the module in enhancing the semantic segmentation accuracy of multi-scale objects and improving the performance of semantic segmentation in complex scenes.

引用

页数：11

共 44 条

[31] SE2Net: semantic segmentation of remote sensing images based on self-attention and edge enhancement modules
Liu, Songlin
Gao, Kai
Qin, Jinchun
Gong, Hui
Wang, Haiyan
Zhang, Li
Gong, Danchao
JOURNAL OF APPLIED REMOTE SENSING, 2021, 15 (02)
[32] UAM-Net: An Attention-Based Multi-level Feature Fusion UNet for Remote Sensing Image Segmentation
Cao, Yiwen
Jiang, Nanfeng
Wang, Da-Han
Wu, Yun
Zhu, Shunzhi
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IV, 2024, 14428 : 267 - 278
[33] Semantic Segmentation of Urban Airborne LiDAR Point Clouds Based on Fusion Attention Mechanism and Multi-Scale Features
Wang, Jingxue
Li, Huan
Xu, Zhenghui
Xie, Xiao
REMOTE SENSING, 2023, 15 (21)
[34] A self-attention based global feature enhancing network for semantic segmentation of large-scale urban street-level point clouds
Chen, Qi
Zhang, Zhenxin
Chen, Siyun
Wen, Siyuan
Ma, Hao
Xu, Zhihua
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 113
[35] Multi-Level Branch Cross-Scale Fusion Network for High-Precision Semantic Segmentation in Complex Remote Sensing Environments
Zeng, Junying
Deng, Senyao
Qin, Chuanbo
Zhai, Yikui
Jia, Xudong
Gu, Yajin
Xu, Jiahua
LASER & OPTOELECTRONICS PROGRESS, 2025, 62 (04)
[36] DS-MSFF-Net: Dual-path self-attention multi-scale feature fusion network for CT image segmentation
Zhang, Xiaoqian
Pu, Lei
Wan, Liming
Wang, Xiao
Zhou, Ying
APPLIED INTELLIGENCE, 2024, 54 (06) : 4490 - 4506
[37] DS-MSFF-Net: Dual-path self-attention multi-scale feature fusion network for CT image segmentation
Xiaoqian Zhang
Lei Pu
Liming Wan
Xiao Wang
Ying Zhou
Applied Intelligence, 2024, 54 : 4490 - 4506
[38] A novel MCF-Net: Multi-level context fusion network for 2D medical image segmentation
Liu, Lizhu
Liu, Yexin
Zhou, Jian
Guo, Cheng
Duan, Huigao
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 226
[39] M2FNet: multi-modality multi-level fusion network for segmentation of acute and sub-acute ischemic stroke
Shannan Chen
Xuanhe Zhao
Yang Duan
Ronghui Ju
Peizhuo Zang
Shouliang Qi
Complex & Intelligent Systems, 2025, 11 (6)
[40] AVR (advancing video retrieval): A new framework guided by multi-level fusion of visual and semantic Features for deep learning-based concept detection
Mohamed Hamroun
Sonia Lajmi
Maryam Jallouli
Multimedia Tools and Applications, 2025, 84 (5) : 2715 - 2777

← 1 2 3 4 5 →