CAT-Unet: An enhanced U-Net architecture with coordinate attention and skip-neighborhood attention transformer for medical image segmentation

被引:8
作者
Ding, Zhiquan [1 ,2 ]
Zhang, Yuejin [1 ,2 ]
Zhu, Chenxin [3 ]
Zhang, Guolong [1 ,2 ]
Li, Xiong [4 ]
Jiang, Nan [1 ]
Que, Yue [1 ]
Peng, Yuanyuan [5 ]
Guan, Xiaohui [6 ]
机构
[1] East China Jiaotong Univ, Sch Informat Engn, Nanchang 330013, Peoples R China
[2] East China Jiaotong Univ, Inst Computat & Biomech, Nanchang 330013, Peoples R China
[3] Xian Jiaotong Liverpool Univ, Sch Math & Phys, Suzhou 215028, Peoples R China
[4] East China Jiaotong Univ, Sch Software, Nanchang 330013, Peoples R China
[5] East China Jiaotong Univ, Sch Elect & Automat Engn, Nanchang 330013, Peoples R China
[6] Nanchang Univ, Natl Engn Res Ctr Bioengn Drugs & Technol, Nanchang, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical image segmentation; Neighborhood attention; Depthwise separable convolutions; Coordinate attention; PLUS PLUS; NETWORK;
D O I
10.1016/j.ins.2024.120578
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rise of deep learning, the U -Net network, based on a U-shaped architecture and skip connections, has found widespread application in various medical image segmentation tasks. However, the receptive field of the standard convolution operation is limited, because it is difficult to achieve global and long-distance semantic information interaction. Inspired by the advantages of ConvNext and Neighborhood Attention (NA), we propose CAT-Unet in this study to address the aforementioned challenges. We effectively reduce the number of parameters by utilizing large kernels and depthwise separable convolutions. Meanwhile, we introduce a Coordinate Attention (CA) module, which enables the model to learn more comprehensive and contextual information from surrounding regions. Furthermore, we introduce Skip -NAT (Neighborhood Attention Transformer) as the main algorithmic framework, replacing U-Net's original skipconnection layers, to lessen the impact of shallow features on network efficiency. Experimental results show that CAT-Unet achieves better segmentation results. On the ISIC2018 dataset, the best results for Dice (Dice Coefficient), IoU (Intersection over Union), and HD (Hausdorff Distance) are 90.26%, 83.58%, and 4.259, respectively. For the PH2 dataset, the best Dice, IoU, and HD results are 96.49%, 91.81%, and 3.971, respectively. Finally, on the DSB2018 dataset, the best Dice, IoU, and HD results are 94.58%, 88.78%, and 3.749, respectively.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] U-Net Transformer: Self and Cross Attention for Medical Image Segmentation
    Petit, Olivier
    Thome, Nicolas
    Rambour, Clement
    Themyr, Loic
    Collins, Toby
    Soler, Luc
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2021, 2021, 12966 : 267 - 276
  • [2] GA-UNet: A Lightweight Ghost and Attention U-Net for Medical Image Segmentation
    Pang, Bo
    Chen, Lianghong
    Tao, Qingchuan
    Wang, Enhui
    Yu, Yanmei
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024, 37 (04): : 1874 - 1888
  • [3] Residual-Attention UNet plus plus : A Nested Residual-Attention U-Net for Medical Image Segmentation
    Li, Zan
    Zhang, Hong
    Li, Zhengzhen
    Ren, Zuyue
    APPLIED SCIENCES-BASEL, 2022, 12 (14):
  • [4] Hybrid dilation and attention residual U-Net for medical image segmentation
    Wang, Zekun
    Zou, Yanni
    Liu, Peter X.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 134
  • [5] Multiresolution Aggregation Transformer UNet Based on Multiscale Input and Coordinate Attention for Medical Image Segmentation
    Chen, Shaolong
    Qiu, Changzhen
    Yang, Weiping
    Zhang, Zhiyong
    SENSORS, 2022, 22 (10)
  • [6] Hybrid Swin Deformable Attention U-Net for Medical Image Segmentation
    Wang, Lichao
    Huang, Jiahao
    Xing, Xiaodan
    Yang, Guang
    2023 19TH INTERNATIONAL SYMPOSIUM ON MEDICAL INFORMATION PROCESSING AND ANALYSIS, SIPAIM, 2023,
  • [7] TransAttUnet: Multi-Level Attention-Guided U-Net With Transformer for Medical Image Segmentation
    Chen, Bingzhi
    Liu, Yishu
    Zhang, Zheng
    Lu, Guangming
    Kong, Adams Wai Kin
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (01): : 55 - 68
  • [8] Enhanced medical image segmentation using U-Net with residual connections and dual attention mechanism
    Xiao, Leyi
    Song, Jiaojiao
    Xie, Xia
    Fan, Chaodong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 153
  • [9] Comparative Analysis of U-Net with Transfer Learning and Attention Mechanism for Enhanced Medical Image Segmentation
    El Abassi, Fouzia
    Darouichi, Aziz
    Ouaarab, Aziz
    DIGITAL TECHNOLOGIES AND APPLICATIONS, ICDTA 2024, VOL 2, 2024, 1099 : 551 - 560
  • [10] MIXED TRANSFORMER U-NET FOR MEDICAL IMAGE SEGMENTATION
    Wang, Hongyi
    Xie, Shiao
    Lin, Lanfen
    Iwamoto, Yutaro
    Han, Xian-Hua
    Chen, Yen-Wei
    Tong, Ruofeng
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2390 - 2394