CAT-Unet: An enhanced U-Net architecture with coordinate attention and skip-neighborhood attention transformer for medical image segmentation

被引:8
作者
Ding, Zhiquan [1 ,2 ]
Zhang, Yuejin [1 ,2 ]
Zhu, Chenxin [3 ]
Zhang, Guolong [1 ,2 ]
Li, Xiong [4 ]
Jiang, Nan [1 ]
Que, Yue [1 ]
Peng, Yuanyuan [5 ]
Guan, Xiaohui [6 ]
机构
[1] East China Jiaotong Univ, Sch Informat Engn, Nanchang 330013, Peoples R China
[2] East China Jiaotong Univ, Inst Computat & Biomech, Nanchang 330013, Peoples R China
[3] Xian Jiaotong Liverpool Univ, Sch Math & Phys, Suzhou 215028, Peoples R China
[4] East China Jiaotong Univ, Sch Software, Nanchang 330013, Peoples R China
[5] East China Jiaotong Univ, Sch Elect & Automat Engn, Nanchang 330013, Peoples R China
[6] Nanchang Univ, Natl Engn Res Ctr Bioengn Drugs & Technol, Nanchang, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical image segmentation; Neighborhood attention; Depthwise separable convolutions; Coordinate attention; PLUS PLUS; NETWORK;
D O I
10.1016/j.ins.2024.120578
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rise of deep learning, the U -Net network, based on a U-shaped architecture and skip connections, has found widespread application in various medical image segmentation tasks. However, the receptive field of the standard convolution operation is limited, because it is difficult to achieve global and long-distance semantic information interaction. Inspired by the advantages of ConvNext and Neighborhood Attention (NA), we propose CAT-Unet in this study to address the aforementioned challenges. We effectively reduce the number of parameters by utilizing large kernels and depthwise separable convolutions. Meanwhile, we introduce a Coordinate Attention (CA) module, which enables the model to learn more comprehensive and contextual information from surrounding regions. Furthermore, we introduce Skip -NAT (Neighborhood Attention Transformer) as the main algorithmic framework, replacing U-Net's original skipconnection layers, to lessen the impact of shallow features on network efficiency. Experimental results show that CAT-Unet achieves better segmentation results. On the ISIC2018 dataset, the best results for Dice (Dice Coefficient), IoU (Intersection over Union), and HD (Hausdorff Distance) are 90.26%, 83.58%, and 4.259, respectively. For the PH2 dataset, the best Dice, IoU, and HD results are 96.49%, 91.81%, and 3.971, respectively. Finally, on the DSB2018 dataset, the best Dice, IoU, and HD results are 94.58%, 88.78%, and 3.749, respectively.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] OAU-net: Outlined Attention U-net for biomedical image segmentation
    Song, Haojie
    Wang, Yuefei
    Zeng, Shijie
    Guo, Xiaoyan
    Li, Zheheng
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
  • [22] Focusing the View: Enhancing U-Net with Convolutional Block Attention for Superior Medical Image Segmentation
    Nhu-Tai Do
    Dat Nguyen Khanh
    Tram-Tran Nguyen-Quynh
    Quoc-Huy Nguyen
    INTELLIGENCE OF THINGS: TECHNOLOGIES AND APPLICATIONS, ICIT 2024, VOL 2, 2025, 230 : 156 - 165
  • [23] Double attention U-Net for brain tumor MR image segmentation
    Li, Na
    Ren, Kai
    INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2021, 14 (03) : 467 - 479
  • [24] Enhancing medical image segmentation with a multi-transformer U-Net
    Dan, Yongping
    Jin, Weishou
    Yue, Xuebin
    Wang, Zhida
    PEERJ, 2024, 12
  • [25] TGDAUNet: Transformer and GCNN based dual-branch attention UNet for medical image segmentation
    Song, Pengfei
    Li, Jinjiang
    Fan, Hui
    Fan, Linwei
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 167
  • [26] DCSAU-Net: A deeper and more compact split-attention U-Net for medical image segmentation
    Xu, Qing
    Ma, Zhicheng
    He, Na
    Duan, Wenting
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 154
  • [27] MAGRes-UNet: Improved Medical Image Segmentation Through a Deep Learning Paradigm of Multi-Attention Gated Residual U-Net
    Hussain, Tahir
    Shouno, Hayaru
    IEEE ACCESS, 2024, 12 : 40290 - 40310
  • [28] ANU-Net: Attention-based nested U-Net to exploit full resolution features for medical image segmentation
    Li, Chen
    Tan, Yusong
    Chen, Wei
    Luo, Xin
    He, Yulin
    Gao, Yuanming
    Li, Fei
    COMPUTERS & GRAPHICS-UK, 2020, 90 : 11 - 20
  • [29] Attention-guided hierarchical fusion U-Net for uncertainty-driven medical image segmentation
    Munia, Afsana Ahmed
    Abdar, Moloud
    Hasan, Mehedi
    Jalali, Mohammad S.
    Banerjee, Biplab
    Khosravi, Abbas
    Hossain, Ibrahim
    Fu, Huazhu
    Frangi, Alejandro F.
    INFORMATION FUSION, 2025, 115
  • [30] Cross Pyramid Transformer makes U-net stronger in medical image segmentation
    Zhu, Jinghua
    Sheng, Yue
    Cui, Hui
    Ma, Jiquan
    Wang, Jijian
    Xi, Heran
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 86