Dual Channel-Spatial Self-Attention Transformer and CNN synergy network for 3D medical image segmentation

被引:0
|
作者
Yang, Fan [1 ]
Wang, Bo [1 ]
机构
[1] Ningxia Univ, Sch Elect & Elect Engn, Yinchuan 750021, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolutional neural layers; Attention collapse; Self-attention mechanism; Transformers; 3D medical image segmentation;
D O I
10.1016/j.asoc.2024.112255
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Even though the Vision Transformer leverages the self-attention mechanism to capture long-range dependencies, showing significant potential in medical image segmentation, the limited annotations in the image dataset make it difficult for the Transformer model to extract different global features, resulting in attention collapse and generating similar or identical attention maps. Previous studies have attempted to solve the problem by integrating convolutional neural layers into Transformer-based architectures. However, improper integration may lead to the inability of the model to effectively capture local and global information in both spatial and channel dimensions. To address the above issue, we propose a hybrid architecture using the Dual Channel-Spatial SelfAttention Transformer and CNN Synergy Network (DTC-SUNETR) for medical image segmentation. Specifically, we redesigned the self-attention mechanism. A novel Channel-Spatial Self-Attention (CSSA) block is introduced to integrate the enhanced channel and spatial self-attention mechanism to capture the global relationship and local structure among image features. This helps the model to more comprehensively understand the interdependencies between different channels and capture the relationships between different pixels, thus enhancing the feature representation of the corresponding dimensions. Simultaneously, it also improves the overall computational efficiency of the network. Extensive experiments on four different medical image segmentation datasets, including Synapse, ACDC, Brain Tumor, and Lung Tumor, demonstrate the superiority of the proposed DTC-SUNETR over state-of-the-art methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] CNN-Transformer and Channel-Spatial Attention based network for hyperspectral image classification with few samples
    Fu, Chuan
    Zhou, Tianyuan
    Guo, Tan
    Zhu, Qikui
    Luo, Fulin
    Du, Bo
    NEURAL NETWORKS, 2025, 186
  • [2] Multiscale fused network with additive channel-spatial attention for image segmentation
    Gao, Chengling
    Ye, Hailiang
    Cao, Feilong
    Wen, Chenglin
    Zhang, Qinghua
    Zhang, Feng
    KNOWLEDGE-BASED SYSTEMS, 2021, 214
  • [3] SPCTNet: A Series-Parallel CNN and Transformer Network for 3D Medical Image Segmentation
    Yu, Bin
    Zhou, Quan
    Zhang, Xuming
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT I, 2024, 14473 : 376 - 387
  • [4] VSmTrans: A hybrid paradigm integrating self-attention and convolution for 3D medical image segmentation
    Liu, Tiange
    Bai, Qingze
    Torigian, Drew A.
    Tong, Yubing
    Udupa, Jayaram K.
    MEDICAL IMAGE ANALYSIS, 2024, 98
  • [5] 3D medical image segmentation using the serial-parallel convolutional neural network and transformer based on cross-window self-attention
    Yu, Bin
    Zhou, Quan
    Yuan, Li
    Liang, Huageng
    Shcherbakov, Pavel
    Zhang, Xuming
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2025,
  • [6] FATUnetr:fully attention Transformer for 3D medical image segmentation
    Li, QingFeng
    Tong, Jigang
    Yang, Sen
    Du, Shengzhi
    2024 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, ICMA 2024, 2024, : 1415 - 1419
  • [7] CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation
    Xie, Yutong
    Zhang, Jianpeng
    Shen, Chunhua
    Xia, Yong
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 : 171 - 180
  • [8] Hybrid transformer-CNN with boundary-awareness network for 3D medical image segmentation
    He, Jianfei
    Xu, Canhui
    APPLIED INTELLIGENCE, 2023, 53 (23) : 28542 - 28554
  • [9] LW-CTrans: A lightweight hybrid network of CNN and Transformer for 3D medical image segmentation
    Kuang, Hulin
    Wang, Yahui
    Tana, Xianzhen
    Yang, Jialin
    Sun, Jiarui
    Liu, Jin
    Qiu, Wu
    Zhang, Jingyang
    Zhang, Jiulou
    Yang, Chunfeng
    Wang, Jianxin
    Chen, Yang
    MEDICAL IMAGE ANALYSIS, 2025, 102
  • [10] Hybrid transformer-CNN with boundary-awareness network for 3D medical image segmentation
    Jianfei He
    Canhui Xu
    Applied Intelligence, 2023, 53 : 28542 - 28554