Contrastive Learning with Continuous Proxy Meta-data for 3D MRI Classification

被引:51
作者
Dufumier, Benoit [1 ,2 ]
Gori, Pietro [2 ]
Victor, Julie [1 ]
Grigis, Antoine [1 ]
Wessa, Michele [3 ]
Brambilla, Paolo [4 ]
Favre, Pauline [1 ]
Polosan, Mircea [5 ]
McDonald, Colm [6 ]
Piguet, Camille Marie [7 ]
Phillips, Mary [8 ]
Eyler, Lisa [9 ]
Duchesnay, Edouard [1 ]
机构
[1] Univ Paris Saclay, CEA Saclay, NeuroSpin, Gif Sur Yvette, France
[2] IPParis, Telecom Paris, LTCI, Paris, France
[3] Johannes Gutenberg Univ Mainz, Dept Neuropsychol, Mainz, Germany
[4] Univ Milan, Fdn IRCCS, Dept Neurosci, Milan, Italy
[5] Univ Grenoble Alpes, CHU Grenoble Alpe, Inserm U1216, Grenoble, France
[6] Ctr Neuroimaging & Cognit Genom NICOG, Galway, Ireland
[7] Univ Geneva, Dept Neurosci, Geneva, Switzerland
[8] Univ Pittsburgh, Dept Psychiat, Western Psychiat Inst, Pittsburgh, PA USA
[9] Univ Calif San Diego, Dept Psychiat, San Diego, CA USA
来源
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT II | 2021年 / 12902卷
关键词
D O I
10.1007/978-3-030-87196-3_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional supervised learning with deep neural networks requires a tremendous amount of labelled data to converge to a good solution. For 3D medical images, it is often impractical to build a large homogeneous annotated dataset for a specific pathology. Self-supervised methods offer a new way to learn a representation of the images in an unsupervised manner with a neural network. In particular, contrastive learning has shown great promises by (almost) matching the performance of fully-supervised CNN on vision tasks. Nonetheless, this method does not take advantage of available meta-data, such as participant's age, viewed as prior knowledge. Here, we propose to leverage continuous proxy metadata, in the contrastive learning framework, by introducing a new loss called y-Aware InfoNCE loss. Specifically, we improve the positive sampling during pre-training by adding more positive examples with similar proxy meta-data with the anchor, assuming they share similar discriminative semantic features. With our method, a 3D CNN model pre-trained on 10(4) multi-site healthy brain MRI scans can extract relevant features for three classification tasks: schizophrenia, bipolar diagnosis and Alzheimer's detection. When fine-tuned, it also outperforms 3D CNN trained from scratch on these tasks, as well as state-of-the-art self-supervised methods. Our code is made publicly available here.
引用
收藏
页码:58 / 68
页数:11
相关论文
共 29 条
[1]  
Caron M, 2020, ADV NEUR IN, V33
[2]  
Chaitanya K., 2020, Contrastive learning of global and local features for medical image segmentation with limited annotations
[3]  
Chapelle Olivier, 2001, VICINAL RISK MINIMIZ
[4]   Self-supervised learning for medical image analysis using image context restoration [J].
Chen, Liang ;
Bentley, Paul ;
Mori, Kensaku ;
Misawa, Kazunari ;
Fujiwara, Michitaka ;
Rueckert, Daniel .
MEDICAL IMAGE ANALYSIS, 2019, 58
[5]  
Chen T, 2020, PR MACH LEARN RES, V119
[6]  
Chuang C.Y., 2020, NEURIPS
[7]  
Devries T., 2017, Dataset augmentation in feature space
[8]  
Ding X., 2021, ICLR
[9]   Unsupervised Visual Representation Learning by Context Prediction [J].
Doersch, Carl ;
Gupta, Abhinav ;
Efros, Alexei A. .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1422-1430
[10]  
Grill Jean-Bastien, 2020, ADV NEURAL INF PROCE, V33, P21271