Segment anything model for medical images?

被引:165
作者
Huang, Yuhao [1 ,2 ,3 ]
Yang, Xin [1 ,2 ,3 ]
Liu, Lian [1 ,2 ,3 ]
Zhou, Han [1 ,2 ,3 ]
Chang, Ao [1 ,2 ,3 ]
Zhou, Xinrui [1 ,2 ,3 ]
Chen, Rusi [1 ,2 ,3 ]
Yu, Junxuan [1 ,2 ,3 ]
Chen, Jiongquan [1 ,2 ,3 ]
Chen, Chaoyu [1 ,2 ,3 ]
Liu, Sijing [1 ,2 ,3 ]
Chi, Haozhe [2 ,4 ]
Hu, Xindi [2 ,5 ]
Yue, Kejuan [2 ,6 ]
Li, Lei [2 ,7 ]
Grau, Vicente [2 ,7 ]
Fan, Deng-Ping [2 ,8 ]
Dong, Fajin [2 ,3 ,9 ,10 ]
Ni, Dong [1 ,2 ,3 ]
机构
[1] Shenzhen Univ, Med Sch, Sch Biomed Engn, Natl Reg Key Technol Engn Lab Med Ultrasound, Shenzhen 518060, Peoples R China
[2] Shenzhen Univ, Med Ultrasound Image Comp MUS Lab, Shenzhen, Peoples R China
[3] Shenzhen Univ, Marshall Lab Biomed Engn, Shenzhen, Peoples R China
[4] Zhejiang Univ, Zhejiang, Peoples R China
[5] Shenzhen RayShape Med Technol Co Ltd, Shenzhen, Peoples R China
[6] Hunan First Normal Univ, Changsha, Peoples R China
[7] Univ Oxford, Dept Engn Sci, Oxford, England
[8] Swiss Fed Inst Technol, Comp Vis Lab CVL, Zurich, Switzerland
[9] Jinan Univ, Clin Med Coll 2, Ultrasound Dept, Guangzhou, Peoples R China
[10] Southern Univ Sci & Technol, Affiliated Hosp 1, Shenzhen Peoples Hosp, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Segment anything model; Medical image segmentation; Medical object perception; ALGORITHMS; VALIDATION; FRAMEWORK;
D O I
10.1016/j.media.2023.103061
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Segment Anything Model (SAM) is the first foundation model for general image segmentation. It has achieved impressive results on various natural image segmentation tasks. However, medical image segmenta-tion (MIS) is more challenging because of the complex modalities, fine anatomical structures, uncertain and complex object boundaries, and wide-range object scales. To fully validate SAM's performance on medical data, we collected and sorted 53 open-source datasets and built a large medical segmentation dataset with 18 modalities, 84 objects, 125 object-modality paired targets, 1050K 2D images, and 6033K masks. We comprehensively analyzed different models and strategies on the so-called COSMOS 1050K dataset. Our findings mainly include the following: (1) SAM showed remarkable performance in some specific objects but was unstable, imperfect, or even totally failed in other situations. (2) SAM with the large ViT-H showed better overall performance than that with the small ViT-B. (3) SAM performed better with manual hints, especially box, than the Everything mode. (4) SAM could help human annotation with high labeling quality and less time. (5) SAM was sensitive to the randomness in the center point and tight box prompts, and may suffer from a serious performance drop. (6) SAM performed better than interactive methods with one or a few points, but will be outpaced as the number of points increases. (7) SAM's performance correlated to different factors, including boundary complexity, intensity differences, etc. (8) Finetuning the SAM on specific medical tasks could improve its average DICE performance by 4.39% and 6.68% for ViT-B and ViT-H, respectively. Codes and models are available at: https://github.com/yuhoo0302/Segment-Anything-Model-for-Medical-Images. We hope that this comprehensive report can help researchers explore the potential of SAM applications in MIS, and guide how to appropriately use and develop SAM.
引用
收藏
页数:21
相关论文
共 101 条
[1]   Neural Segmentation of Seeding ROIs (sROIs) for Pre-Surgical Brain Tractography [J].
Avital, Itzik ;
Nelkenbaum, Ilya ;
Tsarfaty, Galia ;
Konen, Eli ;
Kiryati, Nahum ;
Mayer, Arnaldo .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (05) :1655-1667
[2]  
Bakas S, 2019, Arxiv, DOI arXiv:1811.02629
[3]   Data Descriptor: Advancing The Cancer Genome Atlas glioma MRI collections with expert segmentation labels and radiomic features [J].
Bakas, Spyridon ;
Akbari, Hamed ;
Sotiras, Aristeidis ;
Bilello, Michel ;
Rozycki, Martin ;
Kirby, Justin S. ;
Freymann, John B. ;
Farahani, Keyvan ;
Davatzikos, Christos .
SCIENTIFIC DATA, 2017, 4
[4]   WM-DOVA maps for accurate polyp highlighting in colonoscopy: Validation vs. saliency maps from physicians [J].
Bernal, Jorge ;
Javier Sanchez, F. ;
Fernandez-Esparrach, Gloria ;
Gil, Debora ;
Rodriguez, Cristina ;
Vilarino, Fernando .
COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2015, 43 :99-111
[5]   Deep Learning Techniques for Automatic MRI Cardiac Multi-Structures Segmentation and Diagnosis: Is the Problem Solved? [J].
Bernard, Olivier ;
Lalande, Alain ;
Zotti, Clement ;
Cervenansky, Frederick ;
Yang, Xin ;
Heng, Pheng-Ann ;
Cetin, Irem ;
Lekadir, Karim ;
Camara, Oscar ;
Gonzalez Ballester, Miguel Angel ;
Sanroma, Gerard ;
Napel, Sandy ;
Petersen, Steffen ;
Tziritas, Georgios ;
Grinias, Elias ;
Khened, Mahendra ;
Kollerathu, Varghese Alex ;
Krishnamurthi, Ganapathy ;
Rohe, Marc-Michel ;
Pennec, Xavier ;
Sermesant, Maxime ;
Isensee, Fabian ;
Jaeger, Paul ;
Maier-Hein, Klaus H. ;
Full, Peter M. ;
Wolf, Ivo ;
Engelhardt, Sandy ;
Baumgartner, Christian F. ;
Koch, Lisa M. ;
Wolterink, Jelmer M. ;
Isgum, Ivana ;
Jang, Yeonggul ;
Hong, Yoonmi ;
Patravali, Jay ;
Jain, Shubham ;
Humbert, Olivier ;
Jodoin, Pierre-Marc .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2018, 37 (11) :2514-2525
[6]   The Liver Tumor Segmentation Benchmark (LiTS) [J].
Bilic, Patrick ;
Christ, Patrick ;
Li, Hongwei Bran ;
Vorontsov, Eugene ;
Ben-Cohen, Avi ;
Kaissis, Georgios ;
Szeskin, Adi ;
Jacobs, Colin ;
Mamani, Gabriel Efrain Humpire ;
Chartrand, Gabriel ;
Lohoefer, Fabian ;
Holch, Julian Walter ;
Sommer, Wieland ;
Hofmann, Felix ;
Hostettler, Alexandre ;
Lev-Cohain, Naama ;
Drozdzal, Michal ;
Amitai, Michal Marianne ;
Vivanti, Refael ;
Sosna, Jacob ;
Ezhov, Ivan ;
Sekuboyina, Anjany ;
Navarro, Fernando ;
Kofler, Florian ;
Paetzold, Johannes C. ;
Shit, Suprosanna ;
Hu, Xiaobin ;
Lipkova, Jana ;
Rempfler, Markus ;
Piraud, Marie ;
Kirschke, Jan ;
Wiestler, Benedikt ;
Zhang, Zhiheng ;
Huelsemeyer, Christian ;
Beetz, Marcel ;
Ettlinger, Florian ;
Antonelli, Michela ;
Bae, Woong ;
Bellver, Miriam ;
Bi, Lei ;
Chen, Hao ;
Chlebus, Grzegorz ;
Dam, Erik B. ;
Dou, Qi ;
Fu, Chi-Wing ;
Georgescu, Bogdan ;
Giro-I-Nieto, Xavier ;
Gruen, Felix ;
Han, Xu ;
Heng, Pheng-Ann .
MEDICAL IMAGE ANALYSIS, 2023, 84
[7]  
Butoi VI, 2023, Arxiv, DOI arXiv:2304.06131
[8]   Multi-Centre, Multi-Vendor and Multi-Disease Cardiac Segmentation: The M&Ms Challenge [J].
Campello, Victor M. ;
Gkontra, Polyxeni ;
Izquierdo, Cristian ;
Martin-Isla, Carlos ;
Sojoudi, Alireza ;
Full, Peter M. ;
Maier-Hein, Klaus ;
Zhang, Yao ;
He, Zhiqiang ;
Ma, Jun ;
Parreno, Mario ;
Albiol, Alberto ;
Kong, Fanwei ;
Shadden, Shawn C. ;
Acero, Jorge Corral ;
Sundaresan, Vaanathi ;
Saber, Mina ;
Elattar, Mustafa ;
Li, Hongwei ;
Menze, Bjoern ;
Khader, Firas ;
Haarburger, Christoph ;
Scannell, Cian M. ;
Veta, Mitko ;
Carscadden, Adam ;
Punithakumar, Kumaradevan ;
Liu, Xiao ;
Tsaftaris, Sotirios A. ;
Huang, Xiaoqiong ;
Yang, Xin ;
Li, Lei ;
Zhuang, Xiahai ;
Vilades, David ;
Descalzo, Martin L. ;
Guala, Andrea ;
La Mura, Lucia ;
Friedrich, Matthias G. ;
Garg, Ria ;
Lebel, Julie ;
Henriques, Filipe ;
Karakas, Mahir ;
Cavus, Ersin ;
Petersen, Steffen E. ;
Escalera, Sergio ;
Segui, Santi ;
Rodriguez-Palomares, Jose F. ;
Lekadir, Karim .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (12) :3543-3554
[9]   An Integrated Micro- and Macroarchitectural Analysis of the Drosophila Brain by Computer-Assisted Serial Section Electron Microscopy [J].
Cardona, Albert ;
Saalfeld, Stephan ;
Preibisch, Stephan ;
Schmid, Benjamin ;
Cheng, Anchi ;
Pulokas, Jim ;
Tomancak, Pavel ;
Hartenstein, Volker .
PLOS BIOLOGY, 2010, 8 (10)
[10]  
Chen C, 2023, Arxiv, DOI arXiv:2309.08842