nnU-Net Revisited: A Call for Rigorous Validation in 3D Medical Image Segmentation

被引:29
作者
Isensee, Fabian [1 ,3 ]
Wald, Tassilo [1 ,3 ,7 ]
Ulrich, Constantin [1 ,5 ,6 ]
Baumgartner, Michael [1 ,3 ,7 ]
Roy, Saikat [1 ]
Maier-Hein, Klaus [1 ,3 ,4 ,5 ,6 ,7 ]
Jaeger, Paul F. [2 ,3 ]
机构
[1] German Canc Res Ctr, Div Med Image Comp, Heidelberg, Germany
[2] DKFZ, Interact Machine Learning Grp IML, Heidelberg, Germany
[3] DKFZ, Helmholtz Imaging, Heidelberg, Germany
[4] Heidelberg Univ Hosp, Dept Radiat Oncol, Pattern Anal & Learning Grp, Heidelberg, Germany
[5] Natl Ctr Tumor Dis NCT Heidelberg, Heidelberg, Germany
[6] Heidelberg Univ, Med Fac Heidelberg, Heidelberg, Germany
[7] Heidelberg Univ, Fac Math & Comp Sci, Heidelberg, Germany
来源
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT IX | 2024年 / 15009卷
关键词
Medical Image Segmentation; Validation; Benchmark; TRANSFORMER;
D O I
10.1007/978-3-031-72114-4_47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The release of nnU-Net marked a paradigm shift in 3D medical image segmentation, demonstrating that a properly configured U-Net architecture could still achieve state-of-the-art results. Despite this, the pursuit of novel architectures, and the respective claims of superior performance over the U-Net baseline, continued. In this study, we demonstrate that many of these recent claims fail to hold up when scrutinized for common validation shortcomings, such as the use of inadequate baselines, insufficient datasets, and neglected computational resources. By meticulously avoiding these pitfalls, we conduct a thorough and comprehensive benchmarking of current segmentation methods including CNN-based, Transformer-based, and Mamba-based approaches. In contrast to current beliefs, we find that the recipe for state-of-the-art performance is 1) employing CNN-based U-Net models, including ResNet and ConvNeXt variants, 2) using the nnU-Net framework, and 3) scaling models to modern hardware resources. These results indicate an ongoing innovation bias towards novel architectures in the field and underscore the need for more stringent validation standards in the quest for scientific progress.
引用
收藏
页码:488 / 498
页数:11
相关论文
共 40 条
[1]   The Medical Segmentation Decathlon [J].
Antonelli, Michela ;
Reinke, Annika ;
Bakas, Spyridon ;
Farahani, Keyvan ;
Kopp-Schneider, Annette ;
Landman, Bennett A. ;
Litjens, Geert ;
Menze, Bjoern ;
Ronneberger, Olaf ;
Summers, Ronald M. ;
van Ginneken, Bram ;
Bilello, Michel ;
Bilic, Patrick ;
Christ, Patrick F. ;
Do, Richard K. G. ;
Gollub, Marc J. ;
Heckers, Stephan H. ;
Huisman, Henkjan ;
Jarnagin, William R. ;
McHugo, Maureen K. ;
Napel, Sandy ;
Pernicka, Jennifer S. Golia ;
Rhode, Kawal ;
Tobon-Gomez, Catalina ;
Vorontsov, Eugene ;
Meakin, James A. ;
Ourselin, Sebastien ;
Wiesenfarth, Manuel ;
Arbelaez, Pablo ;
Bae, Byeonguk ;
Chen, Sihong ;
Daza, Laura ;
Feng, Jianjiang ;
He, Baochun ;
Isensee, Fabian ;
Ji, Yuanfeng ;
Jia, Fucang ;
Kim, Ildoo ;
Maier-Hein, Klaus ;
Merhof, Dorit ;
Pai, Akshay ;
Park, Beomhee ;
Perslev, Mathias ;
Rezaiifar, Ramin ;
Rippel, Oliver ;
Sarasua, Ignacio ;
Shen, Wei ;
Son, Jaemin ;
Wachinger, Christian ;
Wang, Liansheng .
NATURE COMMUNICATIONS, 2022, 13 (01)
[2]  
Baid U, 2021, Arxiv, DOI arXiv:2107.02314
[3]   Data Descriptor: Advancing The Cancer Genome Atlas glioma MRI collections with expert segmentation labels and radiomic features [J].
Bakas, Spyridon ;
Akbari, Hamed ;
Sotiras, Aristeidis ;
Bilello, Michel ;
Rozycki, Martin ;
Kirby, Justin S. ;
Freymann, John B. ;
Farahani, Keyvan ;
Davatzikos, Christos .
SCIENTIFIC DATA, 2017, 4
[4]   Deep Learning Techniques for Automatic MRI Cardiac Multi-Structures Segmentation and Diagnosis: Is the Problem Solved? [J].
Bernard, Olivier ;
Lalande, Alain ;
Zotti, Clement ;
Cervenansky, Frederick ;
Yang, Xin ;
Heng, Pheng-Ann ;
Cetin, Irem ;
Lekadir, Karim ;
Camara, Oscar ;
Gonzalez Ballester, Miguel Angel ;
Sanroma, Gerard ;
Napel, Sandy ;
Petersen, Steffen ;
Tziritas, Georgios ;
Grinias, Elias ;
Khened, Mahendra ;
Kollerathu, Varghese Alex ;
Krishnamurthi, Ganapathy ;
Rohe, Marc-Michel ;
Pennec, Xavier ;
Sermesant, Maxime ;
Isensee, Fabian ;
Jaeger, Paul ;
Maier-Hein, Klaus H. ;
Full, Peter M. ;
Wolf, Ivo ;
Engelhardt, Sandy ;
Baumgartner, Christian F. ;
Koch, Lisa M. ;
Wolterink, Jelmer M. ;
Isgum, Ivana ;
Jang, Yeonggul ;
Hong, Yoonmi ;
Patravali, Jay ;
Jain, Shubham ;
Humbert, Olivier ;
Jodoin, Pierre-Marc .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2018, 37 (11) :2514-2525
[5]   The Liver Tumor Segmentation Benchmark (LiTS) [J].
Bilic, Patrick ;
Christ, Patrick ;
Li, Hongwei Bran ;
Vorontsov, Eugene ;
Ben-Cohen, Avi ;
Kaissis, Georgios ;
Szeskin, Adi ;
Jacobs, Colin ;
Mamani, Gabriel Efrain Humpire ;
Chartrand, Gabriel ;
Lohoefer, Fabian ;
Holch, Julian Walter ;
Sommer, Wieland ;
Hofmann, Felix ;
Hostettler, Alexandre ;
Lev-Cohain, Naama ;
Drozdzal, Michal ;
Amitai, Michal Marianne ;
Vivanti, Refael ;
Sosna, Jacob ;
Ezhov, Ivan ;
Sekuboyina, Anjany ;
Navarro, Fernando ;
Kofler, Florian ;
Paetzold, Johannes C. ;
Shit, Suprosanna ;
Hu, Xiaobin ;
Lipkova, Jana ;
Rempfler, Markus ;
Piraud, Marie ;
Kirschke, Jan ;
Wiestler, Benedikt ;
Zhang, Zhiheng ;
Huelsemeyer, Christian ;
Beetz, Marcel ;
Ettlinger, Florian ;
Antonelli, Michela ;
Bae, Woong ;
Bellver, Miriam ;
Bi, Lei ;
Chen, Hao ;
Chlebus, Grzegorz ;
Dam, Erik B. ;
Dou, Qi ;
Fu, Chi-Wing ;
Georgescu, Bogdan ;
Giro-I-Nieto, Xavier ;
Gruen, Felix ;
Han, Xu ;
Heng, Pheng-Ann .
MEDICAL IMAGE ANALYSIS, 2023, 84
[6]  
Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
[7]  
Cardoso M J., 2022, arXiv, DOI [DOI 10.48550/ARXIV.2211.02701, 10.48550/arXiv.2211.02701]
[8]  
Chen J., 2021, arXiv
[9]   UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation [J].
Gao, Yunhe ;
Zhou, Mu ;
Metaxas, Dimitris N. .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 :61-71
[10]  
github, Swinunetr comment on additional training data