Metadata-enhanced contrastive learning from retinal optical coherence tomography images

被引:2
作者
Holland, Robbie [1 ]
Leingang, Oliver [2 ]
Bogunovic, Hrvoje [2 ,3 ]
Riedl, Sophie [2 ]
Fritsche, Lars [4 ]
Prevost, Toby [5 ]
Scholl, Hendrik P. N. [6 ,7 ]
Schmidt-Erfurth, Ursula [2 ]
Sivaprasad, Sobha [8 ,9 ]
Lotery, Andrew J. [10 ]
Rueckert, Daniel [1 ]
Menten, Martin J. [1 ,11 ]
机构
[1] Imperial Coll London, BioMedIA, London, England
[2] Med Univ Vienna, Lab Ophthalm Image Anal, Vienna, Austria
[3] Med Univ Vienna, Christian Doppler Lab Artificial Intelligence Reti, Vienna, Austria
[4] Univ Michigan, Dept Biostat, Ann Arbor, MI USA
[5] Kings Coll London, Nightingale Saunders Clin Trials & Epidemiol Unit, London, England
[6] Inst Mol & Clin Ophthalmol Basel, Basel, Basel Stadt, Switzerland
[7] Univ Basel, Dept Ophthalmol, Basel, Basel Stadt, Switzerland
[8] UCL, Inst Ophthalmol, London, England
[9] Moorfields Eye Hosp, Moorfields Natl Inst Hlth & Care, Biomed Res Ctr, London, England
[10] Univ Southampton, Fac Med, Clin & Expt Sci, Southampton, Hants, England
[11] Tech Univ Munich, Inst AI & Informat Med, Munich, Bavaria, Germany
基金
英国惠康基金;
关键词
Self-supervised learning; Contrastive learning; Retinal OCT; Medical metadata; Longitudinal data; MACULAR DEGENERATION; PREDICTION;
D O I
10.1016/j.media.2024.103296
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has potential to automate screening, monitoring and grading of disease in medical images. Pretraining with contrastive learning enables models to extract robust and generalisable features from natural image datasets, facilitating label-efficient downstream image analysis. However, the direct application of conventional contrastive methods to medical datasets introduces two domain-specific issues. Firstly, several image transformations which have been shown to be crucial for effective contrastive learning do not translate from the natural image to the medical image domain. Secondly, the assumption made by conventional methods, that any two images are dissimilar, is systematically misleading in medical datasets depicting the same anatomy and disease. This is exacerbated in longitudinal image datasets that repeatedly image the same patient cohort to monitor their disease progression over time. In this paper we tackle these issues by extending conventional contrastive frameworks with a novel metadata-enhanced strategy. Our approach employs widely available patient metadata to approximate the true set of inter-image contrastive relationships. To this end we employ records for patient identity, eye position (i.e. left or right) and time series information. In experiments using two large longitudinal datasets containing 170,427 retinal optical coherence tomography (OCT) images of 7912 patients with age-related macular degeneration (AMD), we evaluate the utility of using metadata to incorporate the temporal dynamics of disease progression into pretraining. Our metadata-enhanced approach outperforms both standard contrastive methods and a retinal image foundation model in five out of six image- level downstream tasks related to AMD. We find benefits in both a low-data and high-data regime across tasks ranging from AMD stage and type classification to prediction of visual acuity. Due to its modularity, our method can be quickly and cost-effectively tested to establish the potential benefits of including available metadata in contrastive pretraining.
引用
收藏
页数:10
相关论文
共 50 条
[1]  
Abramoff Michael D, 2010, IEEE Rev Biomed Eng, V3, P169, DOI 10.1109/RBME.2010.2084567
[2]   Big Self-Supervised Models Advance Medical Image Classification [J].
Azizi, Shekoofeh ;
Mustafa, Basil ;
Ryan, Fiona ;
Beaver, Zachary ;
Freyberg, Jan ;
Deaton, Jonathan ;
Loh, Aaron ;
Karthikesalingam, Alan ;
Kornblith, Simon ;
Chen, Ting ;
Natarajan, Vivek ;
Norouzi, Mohammad .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :3458-3468
[3]   Self-Supervised Learning for Cardiac MR Image Segmentation by Anatomical Position Prediction [J].
Bai, Wenjia ;
Chen, Chen ;
Tarroni, Giacomo ;
Duan, Jinming ;
Guitton, Florian ;
Petersen, Steffen E. ;
Guo, Yike ;
Matthews, Paul M. ;
Rueckert, Daniel .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT II, 2019, 11765 :541-549
[4]   A computerized method of visual acuity testing: Adaptation of the early treatment of diabetic retinopathy study testing protocol [J].
Beck, RW ;
Moke, PS ;
Turpin, AH ;
Ferris, FL ;
Sangiovanni, JP ;
Johnson, CA ;
Birch, EE ;
Chandler, DL ;
Cox, TA ;
Blair, RC ;
Kraker, RT .
AMERICAN JOURNAL OF OPHTHALMOLOGY, 2003, 135 (02) :194-205
[5]   Machine Learning of the Progression of Intermediate Age-Related Macular Degeneration Based on OCT Imaging [J].
Bogunovic, Hrvoje ;
Montuoro, Alessio ;
Baratsits, Magdalena ;
Karantonis, Maria G. ;
Waldstein, Sebastian M. ;
Schlanitz, Ferdinand ;
Schmidt-Erfurth, Ursula .
INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2017, 58 (06) :BIO141-BIO150
[6]  
Chaitanya K., 2020, Advances in neural information processing systems, V33, P12546
[7]   Self-supervised learning for medical image analysis using image context restoration [J].
Chen, Liang ;
Bentley, Paul ;
Mori, Kensaku ;
Misawa, Kazunari ;
Fujiwara, Michitaka ;
Rueckert, Daniel .
MEDICAL IMAGE ANALYSIS, 2019, 58
[8]  
Chen T, 2020, PR MACH LEARN RES, V119
[9]  
Chen Ting, 2020, Advances in neural information processing systems, DOI 10.48550/arXiv.2006.10029
[10]   Exploring Simple Siamese Representation Learning [J].
Chen, Xinlei ;
He, Kaiming .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :15745-15753