CSWin-UNet: Transformer UNet with cross-shaped windows for medical image segmentation

被引:8
|
作者
Liu, Xiao [1 ]
Gao, Peng [1 ,3 ]
Yu, Tao [1 ]
Wang, Fei [2 ]
Yuan, Ru-Yue
机构
[1] Qufu Normal Univ, Sch Cyber Sci & Engn, Qufu, Peoples R China
[2] Harbin Inst Technol, Sch Integrated Circuits, Shenzhen, Peoples R China
[3] Yuntian Educ Grp, Hangzhou 253700, Peoples R China
关键词
Medical image segmentation; Deep learning; Attention mechanism; Neural network;
D O I
10.1016/j.inffus.2024.102634
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning, especially convolutional neural networks (CNNs) and Transformer architectures, have become the focus of extensive research in medical image segmentation, achieving impressive results. However, CNNs come with inductive biases that limit their effectiveness in more complex, varied segmentation scenarios. Conversely, while Transformer-based methods excel at capturing global and long-range semantic details, they suffer from high computational demands. In this study, we propose CSWin-UNet, a novel U-shaped segmentation method that incorporates the CSWin self-attention mechanism into the UNet to facilitate horizontal and vertical stripes self-attention. This method significantly enhances both computational efficiency and receptive field interactions. Additionally, our innovative decoder utilizes a content-aware reassembly operator that strategically reassembles features, guided by predicted kernels, for precise image resolution restoration. Our extensive empirical evaluations on diverse datasets, including synapse multi-organ CT, cardiac MRI, and skin lesions demonstrate that CSWin-UNet maintains low model complexity while delivering high segmentation accuracy.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] GCCSwin-UNet: Global Context and Cross-Shaped Windows Vision Transformer Network for Polyp Segmentation
    Zhu, Jianbo
    Ge, Mingfeng
    Chang, Zhimin
    Dong, Wenfei
    PROCESSES, 2023, 11 (04)
  • [2] TransCUNet: UNet cross fused transformer for medical image segmentation
    Jiang, Shen
    Li, Jinjiang
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 150
  • [3] CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows
    Dong, Xiaoyi
    Bao, Jianmin
    Chen, Dongdong
    Zhang, Weiming
    Yu, Nenghai
    Yuan, Lu
    Chen, Dong
    Guo, Baining
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 12114 - 12124
  • [4] AFTer-UNet: Axial Fusion Transformer UNet for Medical Image Segmentation
    Yan, Xiangyi
    Tang, Hao
    Sun, Shanlin
    Ma, Haoyu
    Kong, Deying
    Xie, Xiaohui
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 3270 - 3280
  • [5] SMESwin Unet: Merging CNN and Transformer for Medical Image Segmentation
    Wang, Ziheng
    Min, Xiongkuo
    Shi, Fangyu
    Jin, Ruinian
    Nawrin, Saida S.
    Yu, Ichen
    Nagatomi, Ryoichi
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT V, 2022, 13435 : 517 - 526
  • [6] MCAT-UNet: Convolutional and Cross-Shaped Window Attention Enhanced UNet for Efficient High-Resolution Remote Sensing Image Segmentation
    Wang, Tao
    Xu, Chao
    Liu, Bin
    Yang, Guang
    Zhang, Erlei
    Niu, Dangdang
    Zhang, Hongming
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 9745 - 9758
  • [7] Cross-shaped Separated Spatial-Temporal UNet Transformer For Accurate Channel Prediction
    Kang, Hua
    Hu, Qingyong
    Chen, Huangxun
    Huang, Qianyi
    Zhang, Qian
    Cheng, Min
    IEEE INFOCOM 2024-IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2024, : 2079 - 2088
  • [8] ConvWin-UNet: UNet-like hierarchical vision Transformer combined with convolution for medical image segmentation
    Feng, Xiaomeng
    Wang, Taiping
    Yang, Xiaohang
    Zhang, Minfei
    Guo, Wanpeng
    Wang, Weina
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (01) : 128 - 144
  • [9] Quantity Estimation Method of Warehousing Cargo Based on CSwin-Unet and Pixel Position Weight
    Wu, Jiehao
    Zhang, Guangyuan
    Li, Kefeng
    Wang, Peng
    Li, Zhikang
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 370 - 374
  • [10] A novel full-convolution UNet-transformer for medical image segmentation
    Zhu, Tianyou
    Ding, Derui
    Wang, Feng
    Liang, Wei
    Wang, Bo
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89