Dual Channel-Spatial Self-Attention Transformer and CNN synergy network for 3D medical image segmentation

被引:0
|
作者
Yang, Fan [1 ]
Wang, Bo [1 ]
机构
[1] Ningxia Univ, Sch Elect & Elect Engn, Yinchuan 750021, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolutional neural layers; Attention collapse; Self-attention mechanism; Transformers; 3D medical image segmentation;
D O I
10.1016/j.asoc.2024.112255
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Even though the Vision Transformer leverages the self-attention mechanism to capture long-range dependencies, showing significant potential in medical image segmentation, the limited annotations in the image dataset make it difficult for the Transformer model to extract different global features, resulting in attention collapse and generating similar or identical attention maps. Previous studies have attempted to solve the problem by integrating convolutional neural layers into Transformer-based architectures. However, improper integration may lead to the inability of the model to effectively capture local and global information in both spatial and channel dimensions. To address the above issue, we propose a hybrid architecture using the Dual Channel-Spatial SelfAttention Transformer and CNN Synergy Network (DTC-SUNETR) for medical image segmentation. Specifically, we redesigned the self-attention mechanism. A novel Channel-Spatial Self-Attention (CSSA) block is introduced to integrate the enhanced channel and spatial self-attention mechanism to capture the global relationship and local structure among image features. This helps the model to more comprehensively understand the interdependencies between different channels and capture the relationships between different pixels, thus enhancing the feature representation of the corresponding dimensions. Simultaneously, it also improves the overall computational efficiency of the network. Extensive experiments on four different medical image segmentation datasets, including Synapse, ACDC, Brain Tumor, and Lung Tumor, demonstrate the superiority of the proposed DTC-SUNETR over state-of-the-art methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] SCANeXt: Enhancing 3D medical image segmentation with dual attention network and depth-wise convolution
    Liu, Yajun
    Zhang, Zenghui
    Yue, Jiang
    Guo, Weiwei
    HELIYON, 2024, 10 (05)
  • [32] iCNN-Transformer: An improved CNN-Transformer with Channel-spatial Attention and Keyword Prediction for Automated Audio Captioning
    Chen, Kun
    Wang, Jun
    Deng, Feng
    Wang, Xiaorui
    INTERSPEECH 2022, 2022, : 4167 - 4171
  • [33] SCA-Net: A Spatial and Channel Attention Network for Medical Image Segmentation
    Shan, Tong
    Yan, Jiayong
    IEEE ACCESS, 2021, 9 (09): : 160926 - 160937
  • [34] TT-Net: Tensorized Transformer Network for 3D medical image segmentation
    Wang, Jing
    Qu, Aixi
    Wang, Qing
    Zhao, Qibin
    Liu, Ju
    Wu, Qiang
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2023, 107
  • [35] HCA-former: Hybrid Convolution Attention Transformer for 3D Medical Image Segmentation
    Yang, Fan
    Wang, Fan
    Dong, Pengwei
    Wang, Bo
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 90
  • [36] Multi-modality self-attention aware deep network for 3D biomedical segmentation
    Jia, Xibin
    Liu, Yunfeng
    Yang, Zhenghan
    Yang, Dawei
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (Suppl 3)
  • [37] Multi-modality self-attention aware deep network for 3D biomedical segmentation
    Xibin Jia
    Yunfeng Liu
    Zhenghan Yang
    Dawei Yang
    BMC Medical Informatics and Decision Making, 20
  • [38] CSANet: a channel-spatial attention network for remote sensing image change detection
    Cai, Yuyang
    Liao, Shuhong
    He, Wenxuan
    Huang, Weiliang
    Yan, Jingwen
    Liu, Lei
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (19) : 5936 - 5959
  • [39] Cross self-attention network for 3D point cloud
    Wang, Gaihua
    Zhai, Qianyu
    Liu, Hong
    KNOWLEDGE-BASED SYSTEMS, 2022, 247
  • [40] A deep integrative approach for diabetic retinopathy classification with synergistic channel-spatial and self-attention mechanism
    Madarapu, Sandeep
    Ari, Samit
    Mahapatra, K. K.
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249