Advancements in medical image segmentation: A review of transformer models

被引:6
作者
Kumar, S. S. [1 ]
机构
[1] NICHE, Dept EIE, Kumarakoil 629180, India
关键词
Medical image segmentation; Transformer models; Deep learning; Healthcare; Anatomical structures; NETWORK; NET; DIAGNOSIS; FUSION; LIVER;
D O I
10.1016/j.compeleceng.2025.110099
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Medical image segmentation is crucial for precise diagnosis, treatment planning, and disease monitoring in healthcare. Traditional methods often struggle with the complexity and variability inherent in medical images. However, recent advancements in deep learning, particularly Transformer models, have revolutionized the field. This comprehensive review explores the transformative impact of Transformer models on medical image segmentation. Beginning with an overview of the limitations of traditional approaches, the review introduces foundational Transformer architectures such as the Vision Transformer, Swin Transformer, and Pyramid Vision Transformer. Systematically categorizing Transformer-based segmentation techniques, it delves into their applications across diverse medical imaging tasks, including brain tumor segmentation, polyp detection, cardiac segmentation, and more. Additionally, the review examines the challenges and considerations in benchmarking Transformer models using evaluation metrics and benchmark datasets. By analyzing current research trends and insights, this review provides valuable guidance for researchers and practitioners seeking to harness the power of Transformer models in medical image segmentation.
引用
收藏
页数:51
相关论文
共 235 条
[11]   Swin Unet3D: a three-dimensional medical image segmentation network combining vision transformer and convolution [J].
Cai, Yimin ;
Long, Yuqing ;
Han, Zhenggong ;
Liu, Mingkun ;
Zheng, Yuchen ;
Yang, Wei ;
Chen, Liming .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2023, 23 (01)
[12]   FlowgateUNet: Dental CT image segmentation network based on FlowFormer and gated attention [J].
Cao, Danhua ;
Cai, Biao ;
Liu, Mingzhe .
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (02) :1175-1182
[13]  
Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
[14]   Maxillary sinus detection on cone beam computed tomography images using ResNet and Swin Transformer-based UNet [J].
Celebi, Adalet ;
Imak, Andac ;
Uzen, Huseyin ;
Budak, Umit ;
Turkoglu, Muammer ;
Hanbay, Davut ;
Sengur, Abdulkadir .
ORAL SURGERY ORAL MEDICINE ORAL PATHOLOGY ORAL RADIOLOGY, 2024, 138 (01) :149-161
[15]   Deep Learning in Medical Image Analysis [J].
Chan, Heang-Ping ;
Samala, Ravi K. ;
Hadjiiski, Lubomir M. ;
Zhou, Chuan .
DEEP LEARNING IN MEDICAL IMAGE ANALYSIS: CHALLENGES AND APPLICATIONS, 2020, 1213 :3-21
[16]   TransAttUnet: Multi-Level Attention-Guided U-Net With Transformer for Medical Image Segmentation [J].
Chen, Bingzhi ;
Liu, Yishu ;
Zhang, Zheng ;
Lu, Guangming ;
Kong, Adams Wai Kin .
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (01) :55-68
[17]   PCAT-UNet: UNet-like network fused convolution and transformer for retinal vessel segmentation [J].
Chen, Danny ;
Yang, Wenzhong ;
Wang, Liejun ;
Tan, Sixiang ;
Lin, Jiangzhaung ;
Bu, Wenxiu .
PLOS ONE, 2022, 17 (01)
[18]   Joint Segmentation and Differential Diagnosis of Thyroid Nodule in Contrast-Enhanced Ultrasound Images [J].
Chen, Fang ;
Han, Haojie ;
Wan, Peng ;
Liao, Hongen ;
Liu, Chunrui ;
Zhang, Daoqiang .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2023, 70 (09) :2722-2732
[19]  
Chen J., 2021, PREPRINT
[20]   Transformer-based multilevel region and edge aggregation network for magnetic resonance image segmentation [J].
Chen, Shaolong ;
Zhong, Lijie ;
Qiu, Changzhen ;
Zhang, Zhiyong ;
Zhang, Xiaodong .
COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 152