CSWin-UNet: Transformer UNet with cross-shaped windows for medical image segmentation

被引:8
作者
Liu, Xiao [1 ]
Gao, Peng [1 ,3 ]
Yu, Tao [1 ]
Wang, Fei [2 ]
Yuan, Ru-Yue
机构
[1] Qufu Normal Univ, Sch Cyber Sci & Engn, Qufu, Peoples R China
[2] Harbin Inst Technol, Sch Integrated Circuits, Shenzhen, Peoples R China
[3] Yuntian Educ Grp, Hangzhou 253700, Peoples R China
关键词
Medical image segmentation; Deep learning; Attention mechanism; Neural network;
D O I
10.1016/j.inffus.2024.102634
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning, especially convolutional neural networks (CNNs) and Transformer architectures, have become the focus of extensive research in medical image segmentation, achieving impressive results. However, CNNs come with inductive biases that limit their effectiveness in more complex, varied segmentation scenarios. Conversely, while Transformer-based methods excel at capturing global and long-range semantic details, they suffer from high computational demands. In this study, we propose CSWin-UNet, a novel U-shaped segmentation method that incorporates the CSWin self-attention mechanism into the UNet to facilitate horizontal and vertical stripes self-attention. This method significantly enhances both computational efficiency and receptive field interactions. Additionally, our innovative decoder utilizes a content-aware reassembly operator that strategically reassembles features, guided by predicted kernels, for precise image resolution restoration. Our extensive empirical evaluations on diverse datasets, including synapse multi-organ CT, cardiac MRI, and skin lesions demonstrate that CSWin-UNet maintains low model complexity while delivering high segmentation accuracy.
引用
收藏
页数:12
相关论文
共 55 条
[1]   Liver Tumor Segmentation in CT Scans Using Modified SegNet [J].
Almotairi, Sultan ;
Kareem, Ghada ;
Aouf, Mohamed ;
Almutairi, Badr ;
Salem, Mohammed A-M .
SENSORS, 2020, 20 (05)
[2]   Automated brain tumor segmentation on multi-modal MR image using SegNet [J].
Alqazzaz, Salma ;
Sun, Xianfang ;
Yang, Xin ;
Nokes, Len .
COMPUTATIONAL VISUAL MEDIA, 2019, 5 (02) :209-219
[3]   Deep semantic segmentation of natural and medical images: a review [J].
Asgari Taghanaki, Saeid ;
Abhishek, Kumar ;
Cohen, Joseph Paul ;
Cohen-Adad, Julien ;
Hamarneh, Ghassan .
ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (01) :137-178
[4]   On the Texture Bias for Few-Shot CNN Segmentation [J].
Azad, Reza ;
Fayjie, Abdur R. ;
Kauffmann, Claude ;
Ben Ayed, Ismail ;
Pedersoli, Marco ;
Dolz, Jose .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, :2673-2682
[5]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[6]   Deep Learning Techniques for Automatic MRI Cardiac Multi-Structures Segmentation and Diagnosis: Is the Problem Solved? [J].
Bernard, Olivier ;
Lalande, Alain ;
Zotti, Clement ;
Cervenansky, Frederick ;
Yang, Xin ;
Heng, Pheng-Ann ;
Cetin, Irem ;
Lekadir, Karim ;
Camara, Oscar ;
Gonzalez Ballester, Miguel Angel ;
Sanroma, Gerard ;
Napel, Sandy ;
Petersen, Steffen ;
Tziritas, Georgios ;
Grinias, Elias ;
Khened, Mahendra ;
Kollerathu, Varghese Alex ;
Krishnamurthi, Ganapathy ;
Rohe, Marc-Michel ;
Pennec, Xavier ;
Sermesant, Maxime ;
Isensee, Fabian ;
Jaeger, Paul ;
Maier-Hein, Klaus H. ;
Full, Peter M. ;
Wolf, Ivo ;
Engelhardt, Sandy ;
Baumgartner, Christian F. ;
Koch, Lisa M. ;
Wolterink, Jelmer M. ;
Isgum, Ivana ;
Jang, Yeonggul ;
Hong, Yoonmi ;
Patravali, Jay ;
Jain, Shubham ;
Humbert, Olivier ;
Jodoin, Pierre-Marc .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2018, 37 (11) :2514-2525
[7]  
Bi Q, 2024, AAAI CONF ARTIF INTE, P801
[8]  
Bi Q, 2024, AAAI CONF ARTIF INTE, P810
[9]  
Bi Q, 2024, AAAI CONF ARTIF INTE, P819
[10]  
Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9