Robust and data-efficient generalization of self-supervised machine learning for diagnostic imaging

被引:115
作者
Azizi, Shekoofeh [1 ]
Culp, Laura [1 ]
Freyberg, Jan [1 ]
Mustafa, Basil [1 ]
Baur, Sebastien [1 ]
Kornblith, Simon [1 ]
Chen, Ting [1 ]
Tomasev, Nenad [2 ]
Mitrovic, Jovana [2 ]
Strachan, Patricia [1 ]
Mahdavi, S. Sara [1 ]
Wulczyn, Ellery [1 ]
Babenko, Boris [1 ]
Walker, Megan [1 ]
Loh, Aaron [1 ]
Chen, Po-Hsuan Cameron [1 ]
Liu, Yuan [1 ]
Bavishi, Pinal [1 ]
McKinney, Scott Mayer [1 ]
Winkens, Jim [1 ]
Roy, Abhijit Guha [1 ]
Beaver, Zach [1 ]
Ryan, Fiona [3 ]
Krogue, Justin [1 ]
Etemadi, Mozziyar [4 ]
Telang, Umesh [1 ]
Liu, Yun [1 ]
Peng, Lily [1 ]
Corrado, Greg S. [1 ]
Webster, Dale R. [1 ]
Fleet, David [1 ]
Hinton, Geoffrey [1 ]
Houlsby, Neil [1 ]
Karthikesalingam, Alan [1 ]
Norouzi, Mohammad [1 ]
Natarajan, Vivek [1 ]
机构
[1] Google Res, Mountain View, CA USA
[2] DeepMind, London, England
[3] Georgia Inst Technol, Comp Sci, Atlanta, GA USA
[4] Northwestern Univ, Sch Med, Sch Engn, Chicago, IL USA
关键词
DIABETIC-RETINOPATHY; NEURAL-NETWORK; DEEP; CANCER; CLASSIFICATION; EDEMA;
D O I
10.1038/s41551-023-01049-7
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
A representation-learning strategy for machine-learning models applied to medical-imaging tasks improves model robustness and training efficiency and mitigates suboptimal out-of-distribution performance. Machine-learning models for medical tasks can match or surpass the performance of clinical experts. However, in settings differing from those of the training dataset, the performance of a model can deteriorate substantially. Here we report a representation-learning strategy for machine-learning models applied to medical-imaging tasks that mitigates such 'out of distribution' performance problem and that improves model robustness and training efficiency. The strategy, which we named REMEDIS (for 'Robust and Efficient Medical Imaging with Self-supervision'), combines large-scale supervised transfer learning on natural images and intermediate contrastive self-supervised learning on medical images and requires minimal task-specific customization. We show the utility of REMEDIS in a range of diagnostic-imaging tasks covering six imaging domains and 15 test datasets, and by simulating three realistic out-of-distribution scenarios. REMEDIS improved in-distribution diagnostic accuracies up to 11.5% with respect to strong supervised baseline models, and in out-of-distribution settings required only 1-33% of the data for retraining to match the performance of supervised models retrained using all available data. REMEDIS may accelerate the development lifecycle of machine-learning models for medical imaging.
引用
收藏
页码:756 / +
页数:30
相关论文
共 188 条
[1]  
Albuquerque Isabela., 2020, Adversarial target-invariant representation learning for domain generalization
[2]   Towards a Better Understanding of Transfer Learning for Medical Imaging: A Case Study [J].
Alzubaidi, Laith ;
Fadhel, Mohammed A. ;
Al-Shamma, Omran ;
Zhang, Jinglan ;
Santamaria, J. ;
Duan, Ye ;
Oleiwi, Sameer R. .
APPLIED SCIENCES-BASEL, 2020, 10 (13)
[3]   Optimizing the Performance of Breast Cancer Classification by Employing the Same Domain Transfer Learning from Hybrid Deep Convolutional Neural Network Model [J].
Alzubaidi, Laith ;
Al-Shamma, Omran ;
Fadhel, Mohammed A. ;
Farhan, Laith ;
Zhang, Jinglan ;
Duan, Ye .
ELECTRONICS, 2020, 9 (03)
[4]  
Andreassen Anders., 2021, The evolution of out-of-distribution robustness throughout fine-tuning
[5]  
[Anonymous], 2020, Artificial Intelligence in Health Care: Benefits and Challenges of Machine Learning in Drug Development
[6]   Big Self-Supervised Models Advance Medical Image Classification [J].
Azizi, Shekoofeh ;
Mustafa, Basil ;
Ryan, Fiona ;
Beaver, Zachary ;
Freyberg, Jan ;
Deaton, Jonathan ;
Loh, Aaron ;
Karthikesalingam, Alan ;
Kornblith, Simon ;
Chen, Ting ;
Natarajan, Vivek ;
Norouzi, Mohammad .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :3458-3468
[7]  
Bachman P, 2019, ADV NEUR IN, V32
[8]  
Baevski A, 2020, Arxiv, DOI arXiv:1911.03912
[9]   Self-Supervised Learning for Cardiac MR Image Segmentation by Anatomical Position Prediction [J].
Bai, Wenjia ;
Chen, Chen ;
Tarroni, Giacomo ;
Duan, Jinming ;
Guitton, Florian ;
Petersen, Steffen E. ;
Guo, Yike ;
Matthews, Paul M. ;
Rueckert, Daniel .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT II, 2019, 11765 :541-549
[10]  
Bakalo R, 2019, I S BIOMED IMAGING, P1905, DOI [10.1109/ISBI.2019.8759458, 10.1109/isbi.2019.8759458]