LeViT-UNet: Make Faster Encoders with Transformer for Medical Image Segmentation

被引:124
|
作者
Xu, Guoping [1 ]
Zhang, Xuan [1 ]
He, Xinwei [2 ]
Wu, Xinglong [1 ]
机构
[1] Wuhan Inst Technol, Sch Comp Sci & Engn, Hubei Key Lab Intelligent Robot, Wuhan 430205, Hubei, Peoples R China
[2] Huazhong Agr Univ, Coll Informat, Wuhan 430070, Hubei, Peoples R China
关键词
Medical Image Segmentation; Transformer; Convolutional Neural Network;
D O I
10.1007/978-981-99-8543-2_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Medical image segmentation plays an essential role in developing computer-assisted diagnosis and treatment systems, yet it still faces numerous challenges. In the past few years, Convolutional Neural Networks (CNNs) have been successfully applied to the task of medical image segmentation. Regrettably, due to the locality of convolution operations, these CNN-based architectures have their limitations in learning global context information in images, which might be crucial to the success of medical image segmentation. Meanwhile, the vision Transformer (ViT) architectures own the remarkable ability to extract long-range semantic features with the shortcoming of their computation complexity. To make medical image segmentation more efficient and accurate, we present a novel light-weight architecture named LeViT-UNet, which integrates multi-stage Transformer blocks in the encoder via LeViT, aiming to explore the effectiveness of fusion between local and global features together. Our experiments on two challenging segmentation benchmarks indicate that the proposed LeViT-UNet achieved competitive performance compared with various state-of-the-art methods in terms of efficiency and accuracy, suggesting that LeViT can be a faster feature encoder for medical images segmentation. LeViT-UNet-384, for instance, achieves Dice similarity coefficient (DSC) of 78.53% and 90.32% with a segmentation speed of 85 frames per second (FPS) in the Synapse and ACDC datasets, respectively. Therefore, the proposed architecture could be beneficial for prospective clinic trials conducted by the radiologists. Our source codes are publicly available at https://github.com/apple1986/LeViT_UNet.
引用
收藏
页码:42 / 53
页数:12
相关论文
共 50 条
  • [31] Light-UNet: An Efficient Segmentation Network for Medical Image
    Zhang, Yue
    Xu, Chao
    Zhang, Zhifan
    Wang, Jianjun
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14867 : 302 - 313
  • [32] Semantic Segmentation in Medical Image Based on Hybrid Dlinknet and Unet
    Samudrala, Suresh
    Mohan, C. Krishna
    3rd IEEE 2022 International Conference on Computing, Communication, and Intelligent Systems, ICCCIS 2022, 2022, : 42 - 47
  • [33] Vision Mamba and xLSTM-UNet for medical image segmentation
    Zhong, Xin
    Lu, Gehao
    Li, Hao
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [34] Medical Image Segmentation Using Transformer Networks
    Karimi, Davood
    Dou, Haoran
    Gholipour, Ali
    IEEE ACCESS, 2022, 10 : 29322 - 29332
  • [35] Hybrid Transformer and Convolution for Medical Image Segmentation
    Wang, Fan
    Wang, Bo
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 156 - 159
  • [36] Hybrid Shunted Transformer embedding UNet for remote sensing image semantic segmentation
    Zhou H.
    Xiao X.
    Li H.
    Liu X.
    Liang P.
    Neural Computing and Applications, 2024, 36 (25) : 15705 - 15720
  • [37] ATFormer: Advanced transformer for medical image segmentation
    Chen, Yong
    Lu, Xuesong
    Xie, Oinlan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 85
  • [38] The Fully Convolutional Transformer for Medical Image Segmentation
    Tragakis, Athanasios
    Kaul, Chaitanya
    Murray-Smith, Roderick
    Husmeier, Dirk
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3649 - 3658
  • [39] Automatic Medical Image Segmentation with Vision Transformer
    Zhang, Jie
    Li, Fan
    Zhang, Xin
    Wang, Huaijun
    Hei, Xinhong
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [40] Coformer: Collaborative Transformer for Medical Image Segmentation
    Gao, Yufei
    Zhang, Shichao
    Zhang, Dandan
    Shi, Yucheng
    Zhao, Guohua
    Shi, Lei
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14864 : 240 - 250