Dual Channel-Spatial Self-Attention Transformer and CNN synergy network for 3D medical image segmentation

被引:0
|
作者
Yang, Fan [1 ]
Wang, Bo [1 ]
机构
[1] Ningxia Univ, Sch Elect & Elect Engn, Yinchuan 750021, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolutional neural layers; Attention collapse; Self-attention mechanism; Transformers; 3D medical image segmentation;
D O I
10.1016/j.asoc.2024.112255
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Even though the Vision Transformer leverages the self-attention mechanism to capture long-range dependencies, showing significant potential in medical image segmentation, the limited annotations in the image dataset make it difficult for the Transformer model to extract different global features, resulting in attention collapse and generating similar or identical attention maps. Previous studies have attempted to solve the problem by integrating convolutional neural layers into Transformer-based architectures. However, improper integration may lead to the inability of the model to effectively capture local and global information in both spatial and channel dimensions. To address the above issue, we propose a hybrid architecture using the Dual Channel-Spatial SelfAttention Transformer and CNN Synergy Network (DTC-SUNETR) for medical image segmentation. Specifically, we redesigned the self-attention mechanism. A novel Channel-Spatial Self-Attention (CSSA) block is introduced to integrate the enhanced channel and spatial self-attention mechanism to capture the global relationship and local structure among image features. This helps the model to more comprehensively understand the interdependencies between different channels and capture the relationships between different pixels, thus enhancing the feature representation of the corresponding dimensions. Simultaneously, it also improves the overall computational efficiency of the network. Extensive experiments on four different medical image segmentation datasets, including Synapse, ACDC, Brain Tumor, and Lung Tumor, demonstrate the superiority of the proposed DTC-SUNETR over state-of-the-art methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [11] A dual-branch and dual attention transformer and CNN hybrid network for ultrasound image segmentation
    Zhang, Chong
    Wang, Lingtong
    Wei, Guohui
    Kong, Zhiyong
    Qiu, Min
    FRONTIERS IN PHYSIOLOGY, 2024, 15
  • [12] TPAFNet: Transformer-Driven Pyramid Attention Fusion Network for 3D Medical Image Segmentation
    Li, Zheng
    Zhang, Jinhui
    Wei, Siyi
    Gao, Yueyang
    Cao, Chengwei
    Wu, Zhiwei
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (11) : 6803 - 6814
  • [13] Hybrid 3D Medical Image Segmentation Using CNN and Frequency Transformer Fusion
    Labbihi, Ismayl
    Meslouhi, Othmane El
    Elassad, Zouhair Elamrani Abou
    Benaddy, Mohamed
    Kardouchi, Mustapha
    Akhloufi, Moulay
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024,
  • [14] PFormer: An efficient CNN-Transformer hybrid network with content-driven P-attention for 3D medical image segmentation
    Gao, Yueyang
    Zhang, Jinhui
    Wei, Siyi
    Li, Zheng
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 101
  • [15] DMMFnet: A Dual-Branch Multimodal Medical Image Fusion Network Using Super Token and Channel-Spatial Attention
    Zhang, Yukun
    Wang, Lei
    Tahir, Muhammad
    Huang, Zizhen
    Han, Yaolong
    Yang, Shanliang
    Liu, Shilong
    Saeed, Muhammad Imran
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (08) : 696 - 705
  • [16] MedSegNet: A Lightweight Convolutional Network Combining Dual Self-Attention and Multi-Scale Attention for Medical Image Segmentation
    Bharati, Subrato
    Ahmad, M. Omair
    Swamy, M. N. S.
    2024 IEEE 67TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, MWSCAS 2024, 2024, : 965 - 969
  • [17] A Transformer-Based Network for Anisotropic 3D Medical Image Segmentation
    Guo, Danfeng
    Terzopoulos, Demetri
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8857 - 8861
  • [18] 3D Thyroid Segmentation in CT Using Self-attention Convolutional Neural Network
    He, Xiuxiu
    Guo, Bang Jun
    Lei, Yang
    Liu, Yingzi
    Wang, Tonghe
    Curran, Walter J.
    Zhang, Long Jiang
    Liu, Tian
    Yang, Xiaofeng
    MEDICAL IMAGING 2020: COMPUTER-AIDED DIAGNOSIS, 2020, 11314
  • [19] Lightweight Vision Transformer with Spatial and Channel Enhanced Self-Attention
    Zheng, Jiahao
    Yang, Longqi
    Li, Yiying
    Yang, Ke
    Wang, Zhiyuan
    Zhou, Jun
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 1484 - 1488
  • [20] 3D Residual Networks with Channel-Spatial Attention Module for Action Recognition
    Yi, Ziwen
    Sun, Zhonghua
    Feng, Jinchao
    Jia, Kebin
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 5171 - 5174