CAT-Unet: An enhanced U-Net architecture with coordinate attention and skip-neighborhood attention transformer for medical image segmentation

被引:8
作者
Ding, Zhiquan [1 ,2 ]
Zhang, Yuejin [1 ,2 ]
Zhu, Chenxin [3 ]
Zhang, Guolong [1 ,2 ]
Li, Xiong [4 ]
Jiang, Nan [1 ]
Que, Yue [1 ]
Peng, Yuanyuan [5 ]
Guan, Xiaohui [6 ]
机构
[1] East China Jiaotong Univ, Sch Informat Engn, Nanchang 330013, Peoples R China
[2] East China Jiaotong Univ, Inst Computat & Biomech, Nanchang 330013, Peoples R China
[3] Xian Jiaotong Liverpool Univ, Sch Math & Phys, Suzhou 215028, Peoples R China
[4] East China Jiaotong Univ, Sch Software, Nanchang 330013, Peoples R China
[5] East China Jiaotong Univ, Sch Elect & Automat Engn, Nanchang 330013, Peoples R China
[6] Nanchang Univ, Natl Engn Res Ctr Bioengn Drugs & Technol, Nanchang, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical image segmentation; Neighborhood attention; Depthwise separable convolutions; Coordinate attention; PLUS PLUS; NETWORK;
D O I
10.1016/j.ins.2024.120578
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rise of deep learning, the U -Net network, based on a U-shaped architecture and skip connections, has found widespread application in various medical image segmentation tasks. However, the receptive field of the standard convolution operation is limited, because it is difficult to achieve global and long-distance semantic information interaction. Inspired by the advantages of ConvNext and Neighborhood Attention (NA), we propose CAT-Unet in this study to address the aforementioned challenges. We effectively reduce the number of parameters by utilizing large kernels and depthwise separable convolutions. Meanwhile, we introduce a Coordinate Attention (CA) module, which enables the model to learn more comprehensive and contextual information from surrounding regions. Furthermore, we introduce Skip -NAT (Neighborhood Attention Transformer) as the main algorithmic framework, replacing U-Net's original skipconnection layers, to lessen the impact of shallow features on network efficiency. Experimental results show that CAT-Unet achieves better segmentation results. On the ISIC2018 dataset, the best results for Dice (Dice Coefficient), IoU (Intersection over Union), and HD (Hausdorff Distance) are 90.26%, 83.58%, and 4.259, respectively. For the PH2 dataset, the best Dice, IoU, and HD results are 96.49%, 91.81%, and 3.971, respectively. Finally, on the DSB2018 dataset, the best Dice, IoU, and HD results are 94.58%, 88.78%, and 3.749, respectively.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Multiscale transunet +  + : dense hybrid U-Net with transformer for medical image segmentation
    Bo Wang
    ·Fan Wang
    Pengwei Dong
    ·Chongyi Li
    Signal, Image and Video Processing, 2022, 16 : 1607 - 1614
  • [32] GSAC-UFormer: Groupwise Self-Attention Convolutional Transformer-Based UNet for Medical Image Segmentation
    Garbaz, Anass
    Oukdach, Yassine
    Charfi, Said
    El Ansari, Mohamed
    Koutti, Lahcen
    Salihoun, Mouna
    COGNITIVE COMPUTATION, 2025, 17 (02)
  • [33] ST-Unet: Swin Transformer boosted U-Net with Cross-Layer Feature Enhancement for medical image segmentation
    Zhang, Jing
    Qin, Qiuge
    Ye, Qi
    Ruan, Tong
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 153
  • [34] Narrowing the semantic gaps in U-Net with learnable skip connections: The case of medical image segmentation
    Wang, Haonan
    Cao, Peng
    Yang, Jinzhu
    Zaiane, Osmar
    NEURAL NETWORKS, 2024, 178
  • [35] MP-FocalUNet: Multiscale parallel focal self-attention U-Net for medical image segmentation
    Wang, Chuan
    Jiang, Mingfeng
    Li, Yang
    Wei, Bo
    Li, Yongming
    Wang, Pin
    Yang, Guang
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2025, 260
  • [36] Multi-scale-ResUNet: an improve u-net with multi-scale attention and hybrid dilation for medical image segmentation
    Jin, Tao
    Wang, Zhen
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (18) : 28473 - 28492
  • [37] A Method of Steel Bar Image Segmentation Based on Multi-Attention U-Net
    Shi, Jie
    Wu, Kunpeng
    Yang, Chaolin
    Deng, Nenghui
    IEEE ACCESS, 2021, 9 : 13304 - 13313
  • [38] Enhancing U-Net with Spatial-Channel Attention Gate for Abnormal Tissue Segmentation in Medical Imaging
    Trinh Le Ba Khanh
    Duy-Phuong Dao
    Ngoc-Huynh Ho
    Yang, Hyung-Jeong
    Baek, Eu-Tteum
    Lee, Gueesang
    Kim, Soo-Hyung
    Yoo, Seok Bong
    APPLIED SCIENCES-BASEL, 2020, 10 (17):
  • [39] DS-TransUNet: Dual Swin Transformer U-Net for Medical Image Segmentation
    Lin, Ailiang
    Chen, Bingzhi
    Xu, Jiayu
    Zhang, Zheng
    Lu, Guangming
    Zhang, David
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [40] RefineU-Net: Improved U-Net with progressive global feedbacks and residual attention guided local refinement for medical image segmentation
    Lin, Dongyun
    Li, Yiqun
    Nwe, Tin Lay
    Dong, Sheng
    Oo, Zaw Min
    PATTERN RECOGNITION LETTERS, 2020, 138 : 267 - 275