CAT-Unet: An enhanced U-Net architecture with coordinate attention and skip-neighborhood attention transformer for medical image segmentation

被引:8
作者
Ding, Zhiquan [1 ,2 ]
Zhang, Yuejin [1 ,2 ]
Zhu, Chenxin [3 ]
Zhang, Guolong [1 ,2 ]
Li, Xiong [4 ]
Jiang, Nan [1 ]
Que, Yue [1 ]
Peng, Yuanyuan [5 ]
Guan, Xiaohui [6 ]
机构
[1] East China Jiaotong Univ, Sch Informat Engn, Nanchang 330013, Peoples R China
[2] East China Jiaotong Univ, Inst Computat & Biomech, Nanchang 330013, Peoples R China
[3] Xian Jiaotong Liverpool Univ, Sch Math & Phys, Suzhou 215028, Peoples R China
[4] East China Jiaotong Univ, Sch Software, Nanchang 330013, Peoples R China
[5] East China Jiaotong Univ, Sch Elect & Automat Engn, Nanchang 330013, Peoples R China
[6] Nanchang Univ, Natl Engn Res Ctr Bioengn Drugs & Technol, Nanchang, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical image segmentation; Neighborhood attention; Depthwise separable convolutions; Coordinate attention; PLUS PLUS; NETWORK;
D O I
10.1016/j.ins.2024.120578
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rise of deep learning, the U -Net network, based on a U-shaped architecture and skip connections, has found widespread application in various medical image segmentation tasks. However, the receptive field of the standard convolution operation is limited, because it is difficult to achieve global and long-distance semantic information interaction. Inspired by the advantages of ConvNext and Neighborhood Attention (NA), we propose CAT-Unet in this study to address the aforementioned challenges. We effectively reduce the number of parameters by utilizing large kernels and depthwise separable convolutions. Meanwhile, we introduce a Coordinate Attention (CA) module, which enables the model to learn more comprehensive and contextual information from surrounding regions. Furthermore, we introduce Skip -NAT (Neighborhood Attention Transformer) as the main algorithmic framework, replacing U-Net's original skipconnection layers, to lessen the impact of shallow features on network efficiency. Experimental results show that CAT-Unet achieves better segmentation results. On the ISIC2018 dataset, the best results for Dice (Dice Coefficient), IoU (Intersection over Union), and HD (Hausdorff Distance) are 90.26%, 83.58%, and 4.259, respectively. For the PH2 dataset, the best Dice, IoU, and HD results are 96.49%, 91.81%, and 3.971, respectively. Finally, on the DSB2018 dataset, the best Dice, IoU, and HD results are 94.58%, 88.78%, and 3.749, respectively.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Cross-Layer Connection SegFormer Attention U-Net for Efficient TRUS Image Segmentation
    Shi, Yongtao
    Du, Wei
    Gao, Chao
    Li, Xinzhi
    [J]. INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (05)
  • [42] AESC-TransUnet:Attention Enhanced Selective Channel Transformer U-Net for Medical Image SegmentationAESC-TransUnet:Attention Enhanced Selective Channel...W. Huang, H. Xiao
    Wenlei Huang
    Hongxiang Xiao
    [J]. Signal, Image and Video Processing, 2025, 19 (9)
  • [43] Swin-HAUnet: A Swin-Hierarchical Attention Unet For Enhanced Medical Image Segmentation
    Chen, Jiarong
    Zhang, Xuyang
    Li, Rongwen
    Zhou, Peng
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XIV, 2025, 15044 : 371 - 385
  • [44] DC-UNet: Rethinking the U-Net Architecture with Dual Channel Efficient CNN for Medical Images Segmentation
    Lou, Ange
    Guan, Shuyue
    Loew, Murray
    [J]. MEDICAL IMAGING 2021: IMAGE PROCESSING, 2021, 11596
  • [45] AFC-Unet: Attention-fused full-scale CNN-transformer unet for medical image segmentation
    Meng, Wenjie
    Liu, Shujun
    Wang, Huajun
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 99
  • [46] EU-Net: Automatic U-Net neural architecture search with differential evolutionary algorithm for medical image segmentation
    Yu, Caiyang
    Wang, Yixi
    Tang, Chenwei
    Feng, Wentao
    Lv, Jiancheng
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 167
  • [47] MS-UNet: Swin Transformer U-Net with Multi-scale Nested Decoder for Medical Image Segmentation with Small Training Data
    Chen, Haoyuan
    Han, Yufei
    Li, Yanyi
    Xu, Pin
    Li, Kuan
    Yin, Jianping
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XIII, 2024, 14437 : 472 - 483
  • [48] Multiscale transunet plus plus : dense hybrid U-Net with transformer for medical image segmentation
    Wang, Bo
    Wang, Fan
    Dong, Pengwei
    Li, Chongyi
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (06) : 1607 - 1614
  • [49] TransUMobileNet: Integrating multi-channel attention fusion with hybrid CNN-Transformer architecture for medical image segmentation
    Cai, Sijing
    Jiang, Yukun
    Xiao, Yuwei
    Zeng, Jian
    Zhou, Guangming
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 107
  • [50] Multi-scale-ResUNet: an improve u-net with multi-scale attention and hybrid dilation for medical image segmentation
    Tao Jin
    Zhen Wang
    [J]. Multimedia Tools and Applications, 2023, 82 : 28473 - 28492