Dual Channel-Spatial Self-Attention Transformer and CNN synergy network for 3D medical image segmentation

被引:0
作者
Yang, Fan [1 ]
Wang, Bo [1 ]
机构
[1] Ningxia Univ, Sch Elect & Elect Engn, Yinchuan 750021, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolutional neural layers; Attention collapse; Self-attention mechanism; Transformers; 3D medical image segmentation;
D O I
10.1016/j.asoc.2024.112255
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Even though the Vision Transformer leverages the self-attention mechanism to capture long-range dependencies, showing significant potential in medical image segmentation, the limited annotations in the image dataset make it difficult for the Transformer model to extract different global features, resulting in attention collapse and generating similar or identical attention maps. Previous studies have attempted to solve the problem by integrating convolutional neural layers into Transformer-based architectures. However, improper integration may lead to the inability of the model to effectively capture local and global information in both spatial and channel dimensions. To address the above issue, we propose a hybrid architecture using the Dual Channel-Spatial SelfAttention Transformer and CNN Synergy Network (DTC-SUNETR) for medical image segmentation. Specifically, we redesigned the self-attention mechanism. A novel Channel-Spatial Self-Attention (CSSA) block is introduced to integrate the enhanced channel and spatial self-attention mechanism to capture the global relationship and local structure among image features. This helps the model to more comprehensively understand the interdependencies between different channels and capture the relationships between different pixels, thus enhancing the feature representation of the corresponding dimensions. Simultaneously, it also improves the overall computational efficiency of the network. Extensive experiments on four different medical image segmentation datasets, including Synapse, ACDC, Brain Tumor, and Lung Tumor, demonstrate the superiority of the proposed DTC-SUNETR over state-of-the-art methods.
引用
收藏
页数:12
相关论文
共 43 条
  • [21] 3D Medical image segmentation using parallel transformers
    Yan, Qingsen
    Liu, Shengqiang
    Xu, Songhua
    Dong, Caixia
    Li, Zongfang
    Shi, Javen Qinfeng
    Zhang, Yanning
    Dai, Duwei
    PATTERN RECOGNITION, 2023, 138
  • [22] Emotional analysis of film and television reviews based on self-attention mechanism and dual-channel neural network
    Wang, Fugang
    Gong, Xueliang
    Wang, Xingkai
    Liu, Xuan
    Chen, Yu
    Liu, Zirui
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON MODELING, NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING, CMNM 2024, 2024, : 86 - 90
  • [23] Multi-View 3D Reconstruction Method Based on Self-Attention Mechanism
    Zhu, Guangzhao
    Bo, Wei
    Yang, Afeng
    Xin, Xu
    LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (16)
  • [24] Sequential Spectral-Spatial Feature Convolution Network With Self-Attention for Remote Sensing Hyperspectral Image Classification
    Liu, Jiqing
    Wang, Han
    Liu, Renhe
    Wang, Shaochu
    Liu, Yu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [25] SCTANet: A Spatial Attention-Guided CNN-Transformer Aggregation Network for Deep Face Image Super-Resolution
    Bao, Qiqi
    Liu, Yunmeng
    Gang, Bowen
    Yang, Wenming
    Liao, Qingmin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8554 - 8565
  • [26] GFA-SMT: Geometric Feature Aggregation and Self-Attention in a Multi-Head Transformer for 3D Object Detection in Autonomous Vehicles
    Mushtaq, Husnain
    Deng, Xiaoheng
    Jiang, Ping
    Wan, Shaohua
    Ali, Mubashir
    Ullah, Irshad
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (03) : 3557 - 3573
  • [27] SAT-GCN: Self-attention graph convolutional network-based 3D object detection for autonomous driving
    Wang, Li
    Song, Ziying
    Zhang, Xinyu
    Wang, Chenfei
    Zhang, Guoxin
    Zhu, Lei
    Li, Jun
    Liu, Huaping
    KNOWLEDGE-BASED SYSTEMS, 2023, 259
  • [28] Effective Global Context Integration for Lightweight 3D Medical Image Segmentation
    Qiao, Qiang
    Qu, Meixia
    Wang, Wenyu
    Jiang, Bin
    Guo, Qiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (05) : 4661 - 4674
  • [29] DMCGNet: A Novel Network for Medical Image Segmentation With Dense Self-Mimic and Channel Grouping Mechanism
    Xie, Linsen
    Cai, Wentian
    Gao, Ying
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (10) : 5013 - 5024
  • [30] Domain-Guided Spatio-Temporal Self-Attention for Egocentric 3D Pose Estimation
    Park, Jinman
    Kaai, Kimathi
    Hossain, Saad
    Sumi, Norikatsu
    Rambhatla, Sirisha
    Fieguth, Paul
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 1837 - 1849