Dual Channel-Spatial Self-Attention Transformer and CNN synergy network for 3D medical image segmentation

被引:0
作者
Yang, Fan [1 ]
Wang, Bo [1 ]
机构
[1] Ningxia Univ, Sch Elect & Elect Engn, Yinchuan 750021, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolutional neural layers; Attention collapse; Self-attention mechanism; Transformers; 3D medical image segmentation;
D O I
10.1016/j.asoc.2024.112255
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Even though the Vision Transformer leverages the self-attention mechanism to capture long-range dependencies, showing significant potential in medical image segmentation, the limited annotations in the image dataset make it difficult for the Transformer model to extract different global features, resulting in attention collapse and generating similar or identical attention maps. Previous studies have attempted to solve the problem by integrating convolutional neural layers into Transformer-based architectures. However, improper integration may lead to the inability of the model to effectively capture local and global information in both spatial and channel dimensions. To address the above issue, we propose a hybrid architecture using the Dual Channel-Spatial SelfAttention Transformer and CNN Synergy Network (DTC-SUNETR) for medical image segmentation. Specifically, we redesigned the self-attention mechanism. A novel Channel-Spatial Self-Attention (CSSA) block is introduced to integrate the enhanced channel and spatial self-attention mechanism to capture the global relationship and local structure among image features. This helps the model to more comprehensively understand the interdependencies between different channels and capture the relationships between different pixels, thus enhancing the feature representation of the corresponding dimensions. Simultaneously, it also improves the overall computational efficiency of the network. Extensive experiments on four different medical image segmentation datasets, including Synapse, ACDC, Brain Tumor, and Lung Tumor, demonstrate the superiority of the proposed DTC-SUNETR over state-of-the-art methods.
引用
收藏
页数:12
相关论文
共 43 条
  • [31] Grid self-attention mechanism 3D object detection method based on raw point cloud
    Lu B.
    Sun Y.
    Yang Z.
    Tongxin Xuebao/Journal on Communications, 2023, 44 (10): : 72 - 84
  • [32] UNETR plus plus : Delving Into Efficient and Accurate 3D Medical Image Segmentation
    Shaker, Abdelrahman
    Maaz, Muhammad
    Rasheed, Hanoona
    Khan, Salman
    Yang, Ming-Hsuan
    Khan, Fahad Shahbaz
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (09) : 3377 - 3390
  • [33] MSA-Net: multiple self-attention mechanism for 3D lung nodule classification in CT images
    Jiating Pan
    Lishi Liang
    Peng Sun
    Yongbo Liang
    Jianming Zhu
    Zhencheng Chen
    BMC Medical Imaging, 25 (1)
  • [34] DPKI-Net: Dual Prior Knowledge Injection Network for Multitask 3-D Medical Image Segmentation and Landmark Localization
    Li, Xiang
    Li, Like
    Zhang, Kesheng
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [35] Graph Neural Network and Spatiotemporal Transformer Attention for 3D Video Object Detection From Point Clouds
    Yin, Junbo
    Shen, Jianbing
    Gao, Xin
    Crandall, David J.
    Yang, Ruigang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9822 - 9835
  • [36] Differential Self-Feedback Dilated Convolution Network With Dual-Tree Channel Attention Mechanism for Hyperspectral Image Classification
    Xiao, Zhiqiang
    Ye, Kuntao
    Cui, Guolong
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 17
  • [37] Spatiotemporal Self-Attention Mechanism Driven by 3D Pose to Guide RGB Cues for Daily Living Human Activity Recognition
    Basly, Hend
    Zayene, Mohamed Amine
    Sayadi, Fatma Ezahra
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2023, 109 (01)
  • [38] Spatiotemporal Self-Attention Mechanism Driven by 3D Pose to Guide RGB Cues for Daily Living Human Activity Recognition
    Hend Basly
    Mohamed Amine Zayene
    Fatma Ezahra Sayadi
    Journal of Intelligent & Robotic Systems, 2023, 109
  • [39] SSCFormer: Revisiting ConvNet-Transformer Hybrid Framework From Scale-Wise and Spatial-Channel-Aware Perspectives for Volumetric Medical Image Segmentation
    Xie, Qinlan
    Chen, Yong
    Liu, Shenglin
    Lu, Xuesong
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (08) : 4830 - 4841
  • [40] MFFTNet: A Novel 3D Point Cloud Segmentation Network Based on Multi-Scale Feature Fusion and Transformer Architecture
    Bai, Hao
    Li, Xiongwei
    Meng, Qing
    Zhuo, Shulong
    Yan, Lili
    IEEE ACCESS, 2025, 13 : 9462 - 9472