Medical Image Segmentation Review: The Success of U-Net

被引:146
作者
Azad, Reza [1 ,2 ]
Aghdam, Ehsan Khodapanah
Rauland, Amelie [1 ]
Jia, Yiwei [1 ]
Avval, Atlas Haddadi [3 ]
Bozorgpour, Afshin [2 ]
Karimijafarbigloo, Sanaz [2 ]
Cohen, Joseph Paul [4 ]
Adeli, Ehsan [5 ]
Merhof, Dorit [2 ,6 ]
机构
[1] Rhein Westfal TH Aachen, Fac Elect Engn, Informat Technol, D-52074 Aachen, Germany
[2] Univ Regensburg, Fac Informat & Data Sci, D-93053 Regensburg, Germany
[3] Mashhad Univ Med Sci, Sch Med, Mashhad 9177899191, Razavi Khorasan, Iran
[4] Stanford Univ, Ctr Artificial Intelligence Med & Imaging, Palo Alto, CA 94304 USA
[5] Stanford Univ, Stanford, CA 94305 USA
[6] Fraunhofer Inst Digital Med MEVIS, D-28359 Bremen, Germany
基金
美国国家科学基金会;
关键词
Image segmentation; Biomedical imaging; Taxonomy; Computer architecture; Feature extraction; Transformers; Task analysis; Convolutional neural network; deep learning; medical image segmentation; transformer; U-Net; UNET PLUS PLUS; NETWORK; CANCER; CLASSIFICATION; ARCHITECTURE; ATTENTION; LIVER;
D O I
10.1109/TPAMI.2024.3435571
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic medical image segmentation is a crucial topic in the medical domain and successively a critical counterpart in the computer-aided diagnosis paradigm. U-Net is the most widespread image segmentation architecture due to its flexibility, optimized modular design, and success in all medical image modalities. Over the years, the U-Net model has received tremendous attention from academic and industrial researchers who have extended it to address the scale and complexity created by medical tasks. These extensions are commonly related to enhancing the U-Net's backbone, bottleneck, or skip connections, or including representation learning, or combining it with a Transformer architecture, or even addressing probabilistic prediction of the segmentation map. Having a compendium of different previously proposed U-Net variants makes it easier for machine learning researchers to identify relevant research questions and understand the challenges of the biological tasks that challenge the model. In this work, we discuss the practical aspects of the U-Net model and organize each variant model into a taxonomy. Moreover, to measure the performance of these strategies in a clinical application, we propose fair evaluations of some unique and famous designs on well-known datasets. Furthermore, we provide a comprehensive implementation library with trained models. In addition, for ease of future studies, we created an online list of U-Net papers with their possible official implementation.
引用
收藏
页码:10076 / 10095
页数:20
相关论文
共 135 条
[1]  
Abraham N, 2019, I S BIOMED IMAGING, P683, DOI 10.1109/ISBI.2019.8759329
[2]  
Alakwaa W, 2017, INT J ADV COMPUT SC, V8, P409
[3]   Recurrent residual U-Net for medical image segmentation [J].
Alom, Md Zahangir ;
Yakopcic, Chris ;
Hasan, Mahmudul ;
Taha, Tarek M. ;
Asari, Vijayan K. .
JOURNAL OF MEDICAL IMAGING, 2019, 6 (01)
[4]  
[Anonymous], 2006, Pattern recognition and machine learning
[5]  
Antonelli M., 2022, Nat. Commun., V1, P1
[6]  
Asadi-Aghbolaghi M, 2020, Arxiv, DOI arXiv:2003.05056
[7]   U-Net with Spatial Pyramid Pooling for Drusen Segmentation in Optical Coherence Tomography [J].
Asgari, Rhona ;
Waldstein, Sebastian ;
Schlanitz, Ferdinand ;
Baratsits, Magdalena ;
Schmidt-Erfurth, Ursula ;
Bogunovic, Hrvoje .
OPHTHALMIC MEDICAL IMAGE ANALYSIS, 2019, 11855 :77-85
[8]   Deep semantic segmentation of natural and medical images: a review [J].
Asgari Taghanaki, Saeid ;
Abhishek, Kumar ;
Cohen, Joseph Paul ;
Cohen-Adad, Julien ;
Hamarneh, Ghassan .
ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (01) :137-178
[9]  
Azad R, 2023, Arxiv, DOI arXiv:2301.03505
[10]  
Azad R, 2022, Arxiv, DOI arXiv:2203.06217