Dual Channel-Spatial Self-Attention Transformer and CNN synergy network for 3D medical image segmentation

被引：0

作者：

Yang, Fan ^{[1
]}

Wang, Bo ^{[1
]}

机构：

[1] Ningxia Univ, Sch Elect & Elect Engn, Yinchuan 750021, Peoples R China

来源：

APPLIED SOFT COMPUTING | 2024年 / 167卷

基金：

中国国家自然科学基金;

关键词：

Convolutional neural layers; Attention collapse; Self-attention mechanism; Transformers; 3D medical image segmentation;

D O I：

10.1016/j.asoc.2024.112255

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Even though the Vision Transformer leverages the self-attention mechanism to capture long-range dependencies, showing significant potential in medical image segmentation, the limited annotations in the image dataset make it difficult for the Transformer model to extract different global features, resulting in attention collapse and generating similar or identical attention maps. Previous studies have attempted to solve the problem by integrating convolutional neural layers into Transformer-based architectures. However, improper integration may lead to the inability of the model to effectively capture local and global information in both spatial and channel dimensions. To address the above issue, we propose a hybrid architecture using the Dual Channel-Spatial SelfAttention Transformer and CNN Synergy Network (DTC-SUNETR) for medical image segmentation. Specifically, we redesigned the self-attention mechanism. A novel Channel-Spatial Self-Attention (CSSA) block is introduced to integrate the enhanced channel and spatial self-attention mechanism to capture the global relationship and local structure among image features. This helps the model to more comprehensively understand the interdependencies between different channels and capture the relationships between different pixels, thus enhancing the feature representation of the corresponding dimensions. Simultaneously, it also improves the overall computational efficiency of the network. Extensive experiments on four different medical image segmentation datasets, including Synapse, ACDC, Brain Tumor, and Lung Tumor, demonstrate the superiority of the proposed DTC-SUNETR over state-of-the-art methods.

引用

页数：12

共 43 条

[21] 3D Medical image segmentation using parallel transformers
Yan, Qingsen
Liu, Shengqiang
Xu, Songhua
Dong, Caixia
Li, Zongfang
Shi, Javen Qinfeng
Zhang, Yanning
Dai, Duwei
PATTERN RECOGNITION, 2023, 138
[22] Emotional analysis of film and television reviews based on self-attention mechanism and dual-channel neural network
Wang, Fugang
Gong, Xueliang
Wang, Xingkai
Liu, Xuan
Chen, Yu
Liu, Zirui
PROCEEDINGS OF INTERNATIONAL CONFERENCE ON MODELING, NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING, CMNM 2024, 2024, : 86 - 90
[23] Multi-View 3D Reconstruction Method Based on Self-Attention Mechanism
Zhu, Guangzhao
Bo, Wei
Yang, Afeng
Xin, Xu
LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (16)
[24] Sequential Spectral-Spatial Feature Convolution Network With Self-Attention for Remote Sensing Hyperspectral Image Classification
Liu, Jiqing
Wang, Han
Liu, Renhe
Wang, Shaochu
Liu, Yu
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
[25] SCTANet: A Spatial Attention-Guided CNN-Transformer Aggregation Network for Deep Face Image Super-Resolution
Bao, Qiqi
Liu, Yunmeng
Gang, Bowen
Yang, Wenming
Liao, Qingmin
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8554 - 8565
[26] GFA-SMT: Geometric Feature Aggregation and Self-Attention in a Multi-Head Transformer for 3D Object Detection in Autonomous Vehicles
Mushtaq, Husnain
Deng, Xiaoheng
Jiang, Ping
Wan, Shaohua
Ali, Mubashir
Ullah, Irshad
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (03) : 3557 - 3573
[27] SAT-GCN: Self-attention graph convolutional network-based 3D object detection for autonomous driving
Wang, Li
Song, Ziying
Zhang, Xinyu
Wang, Chenfei
Zhang, Guoxin
Zhu, Lei
Li, Jun
Liu, Huaping
KNOWLEDGE-BASED SYSTEMS, 2023, 259
[28] Effective Global Context Integration for Lightweight 3D Medical Image Segmentation
Qiao, Qiang
Qu, Meixia
Wang, Wenyu
Jiang, Bin
Guo, Qiang
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (05) : 4661 - 4674
[29] DMCGNet: A Novel Network for Medical Image Segmentation With Dense Self-Mimic and Channel Grouping Mechanism
Xie, Linsen
Cai, Wentian
Gao, Ying
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (10) : 5013 - 5024
[30] Domain-Guided Spatio-Temporal Self-Attention for Egocentric 3D Pose Estimation
Park, Jinman
Kaai, Kimathi
Hossain, Saad
Sumi, Norikatsu
Rambhatla, Sirisha
Fieguth, Paul
PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 1837 - 1849

← 1 2 3 4 5 →