Dual Channel-Spatial Self-Attention Transformer and CNN synergy network for 3D medical image segmentation

被引:0
作者
Yang, Fan [1 ]
Wang, Bo [1 ]
机构
[1] Ningxia Univ, Sch Elect & Elect Engn, Yinchuan 750021, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolutional neural layers; Attention collapse; Self-attention mechanism; Transformers; 3D medical image segmentation;
D O I
10.1016/j.asoc.2024.112255
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Even though the Vision Transformer leverages the self-attention mechanism to capture long-range dependencies, showing significant potential in medical image segmentation, the limited annotations in the image dataset make it difficult for the Transformer model to extract different global features, resulting in attention collapse and generating similar or identical attention maps. Previous studies have attempted to solve the problem by integrating convolutional neural layers into Transformer-based architectures. However, improper integration may lead to the inability of the model to effectively capture local and global information in both spatial and channel dimensions. To address the above issue, we propose a hybrid architecture using the Dual Channel-Spatial SelfAttention Transformer and CNN Synergy Network (DTC-SUNETR) for medical image segmentation. Specifically, we redesigned the self-attention mechanism. A novel Channel-Spatial Self-Attention (CSSA) block is introduced to integrate the enhanced channel and spatial self-attention mechanism to capture the global relationship and local structure among image features. This helps the model to more comprehensively understand the interdependencies between different channels and capture the relationships between different pixels, thus enhancing the feature representation of the corresponding dimensions. Simultaneously, it also improves the overall computational efficiency of the network. Extensive experiments on four different medical image segmentation datasets, including Synapse, ACDC, Brain Tumor, and Lung Tumor, demonstrate the superiority of the proposed DTC-SUNETR over state-of-the-art methods.
引用
收藏
页数:12
相关论文
共 43 条
  • [11] Transformer-Based Dual-Channel Self-Attention for UUV Autonomous Collision Avoidance
    Lin, Changjian
    Cheng, Yuhu
    Wang, Xuesong
    Yuan, Jianya
    Wang, Guoqing
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (03): : 2319 - 2331
  • [12] CSA-CNN: A Contrastive Self-Attention Neural Network for Pupil Segmentation in Eye Gaze Tracking
    Chugh, Soumil
    Fu, Yuqi
    Ye, Juntao
    Eizenman, Moshe
    PROCEEDINGS OF THE 2024 ACM SYMPOSIUM ON EYE TRACKING RESEARCH & APPLICATIONS, ETRA 2024, 2024,
  • [13] TGDAUNet: Transformer and GCNN based dual-branch attention UNet for medical image segmentation
    Song, Pengfei
    Li, Jinjiang
    Fan, Hui
    Fan, Linwei
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 167
  • [14] GLSANet: Global-Local Self-Attention Network for Remote Sensing Image Semantic Segmentation
    Hu, Xudong
    Zhang, Penglin
    Zhang, Qi
    Yuan, Feng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [15] GLSANet: Global-Local Self-Attention Network for Remote Sensing Image Semantic Segmentation
    Hu, Xudong
    Zhang, Penglin
    Zhang, Qi
    Yuan, Feng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [16] DPCTN: Dual path context-aware transformer network for medical image segmentation
    Song, Pengfei
    Yang, Zhe
    Li, Jinjiang
    Fan, Hui
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 124
  • [17] LHDACT: Lightweight Hybrid Dual Attention CNN and Transformer Network for Remote Sensing Image Change Detection
    Song, Xinyang
    Hua, Zhen
    Li, Jinjiang
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [18] MISSU: 3D Medical Image Segmentation via Self-Distilling TransUNet
    Wang, Nan
    Lin, Shaohui
    Li, Xiaoxiao
    Li, Ke
    Shen, Yunhang
    Gao, Yue
    Ma, Lizhuang
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (09) : 2740 - 2750
  • [19] 3DGTN: 3-D Dual-Attention GLocal Transformer Network for Point Cloud Classification and Segmentation
    Lu, Dening
    Gao, Kyle
    Xie, Qian
    Xu, Linlin
    Li, Jonathan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 13
  • [20] Multi-Head Self-Attention for 3D Point Cloud Classification
    Gao, Xue-Yao
    Wang, Yan-Zhao
    Zhang, Chun-Xiang
    Lu, Jia-Qi
    IEEE ACCESS, 2021, 9 : 18137 - 18147