Dual Channel-Spatial Self-Attention Transformer and CNN synergy network for 3D medical image segmentation

被引：0

作者：

Yang, Fan ^{[1
]}

Wang, Bo ^{[1
]}

机构：

[1] Ningxia Univ, Sch Elect & Elect Engn, Yinchuan 750021, Peoples R China

来源：

APPLIED SOFT COMPUTING | 2024年 / 167卷

基金：

中国国家自然科学基金;

关键词：

Convolutional neural layers; Attention collapse; Self-attention mechanism; Transformers; 3D medical image segmentation;

D O I：

10.1016/j.asoc.2024.112255

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Even though the Vision Transformer leverages the self-attention mechanism to capture long-range dependencies, showing significant potential in medical image segmentation, the limited annotations in the image dataset make it difficult for the Transformer model to extract different global features, resulting in attention collapse and generating similar or identical attention maps. Previous studies have attempted to solve the problem by integrating convolutional neural layers into Transformer-based architectures. However, improper integration may lead to the inability of the model to effectively capture local and global information in both spatial and channel dimensions. To address the above issue, we propose a hybrid architecture using the Dual Channel-Spatial SelfAttention Transformer and CNN Synergy Network (DTC-SUNETR) for medical image segmentation. Specifically, we redesigned the self-attention mechanism. A novel Channel-Spatial Self-Attention (CSSA) block is introduced to integrate the enhanced channel and spatial self-attention mechanism to capture the global relationship and local structure among image features. This helps the model to more comprehensively understand the interdependencies between different channels and capture the relationships between different pixels, thus enhancing the feature representation of the corresponding dimensions. Simultaneously, it also improves the overall computational efficiency of the network. Extensive experiments on four different medical image segmentation datasets, including Synapse, ACDC, Brain Tumor, and Lung Tumor, demonstrate the superiority of the proposed DTC-SUNETR over state-of-the-art methods.

引用

页数：12

共 43 条

[1] TPAFNet: Transformer-Driven Pyramid Attention Fusion Network for 3D Medical Image Segmentation
Li, Zheng
Zhang, Jinhui
Wei, Siyi
Gao, Yueyang
Cao, Chengwei
Wu, Zhiwei
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (11) : 6803 - 6814
[2] Permutation invariant self-attention infused U-shaped transformer for medical image segmentation
Patil, Sanjeet S.
Ramteke, Manojkumar
Rathore, Anurag S.
NEUROCOMPUTING, 2025, 625
[3] HCA-former: Hybrid Convolution Attention Transformer for 3D Medical Image Segmentation
Yang, Fan
Wang, Fan
Dong, Pengwei
Wang, Bo
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 90
[4] SACA-UNet:Medical Image Segmentation Network Based on Self-Attention and ASPP
Fan, Gaojuan
Wang, Jie
Zhang, Chongsheng
2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 317 - 322
[5] Spatial and channel enhanced self-attention network for efficient single image super-resolution
Song, Xiaogang
Tan, Yuping
Pang, Xinchao
Zhang, Lei
Lu, Xiaofeng
Hei, Xinhong
NEUROCOMPUTING, 2025, 620
[6] 3D CATBraTS: Channel Attention Transformer for Brain Tumour Semantic Segmentation
El Badaoui, Rim
Coll, Bonmati
Psarrou, Aleka
Villarini, Barbara
2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 489 - 494
[7] Dual Self-Attention Swin Transformer for Hyperspectral Image Super-Resolution
Long, Yaqian
Wang, Xun
Xu, Meng
Zhang, Shuyu
Jiang, Shuguo
Jia, Sen
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[8] Deep 3D Neural Network for Brain Structures Segmentation Using Self-Attention Modules in MRI Images
Laiton-Bonadiez, Camilo
Sanchez-Torres, German
Branch-Bedoya, John
SENSORS, 2022, 22 (07)
[9] DCTN: Dual-Branch Convolutional Transformer Network With Efficient Interactive Self-Attention for Hyperspectral Image Classification
Zhou, Yunfei
Huang, Xiaohui
Yang, Xiaofei
Peng, Jiangtao
Ban, Yifang
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 16
[10] U-Net Transformer: Self and Cross Attention for Medical Image Segmentation
Petit, Olivier
Thome, Nicolas
Rambour, Clement
Themyr, Loic
Collins, Toby
Soler, Luc
MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2021, 2021, 12966 : 267 - 276

← 1 2 3 4 5 →