Contrastive Learning with Continuous Proxy Meta-data for 3D MRI Classification

被引：51

作者：

Dufumier, Benoit ^{[1
,2
]}

Gori, Pietro ^{[2
]}

Victor, Julie ^{[1
]}

Grigis, Antoine ^{[1
]}

Wessa, Michele ^{[3
]}

Brambilla, Paolo ^{[4
]}

Favre, Pauline ^{[1
]}

Polosan, Mircea ^{[5
]}

McDonald, Colm ^{[6
]}

Piguet, Camille Marie ^{[7
]}

Phillips, Mary ^{[8
]}

Eyler, Lisa ^{[9
]}

Duchesnay, Edouard ^{[1
]}

机构：

[1] Univ Paris Saclay, CEA Saclay, NeuroSpin, Gif Sur Yvette, France

[2] IPParis, Telecom Paris, LTCI, Paris, France

[3] Johannes Gutenberg Univ Mainz, Dept Neuropsychol, Mainz, Germany

[4] Univ Milan, Fdn IRCCS, Dept Neurosci, Milan, Italy

[5] Univ Grenoble Alpes, CHU Grenoble Alpe, Inserm U1216, Grenoble, France

[6] Ctr Neuroimaging & Cognit Genom NICOG, Galway, Ireland

[7] Univ Geneva, Dept Neurosci, Geneva, Switzerland

[8] Univ Pittsburgh, Dept Psychiat, Western Psychiat Inst, Pittsburgh, PA USA

[9] Univ Calif San Diego, Dept Psychiat, San Diego, CA USA

来源：

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT II | 2021年 / 12902卷

关键词：

D O I：

10.1007/978-3-030-87196-3_6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Traditional supervised learning with deep neural networks requires a tremendous amount of labelled data to converge to a good solution. For 3D medical images, it is often impractical to build a large homogeneous annotated dataset for a specific pathology. Self-supervised methods offer a new way to learn a representation of the images in an unsupervised manner with a neural network. In particular, contrastive learning has shown great promises by (almost) matching the performance of fully-supervised CNN on vision tasks. Nonetheless, this method does not take advantage of available meta-data, such as participant's age, viewed as prior knowledge. Here, we propose to leverage continuous proxy metadata, in the contrastive learning framework, by introducing a new loss called y-Aware InfoNCE loss. Specifically, we improve the positive sampling during pre-training by adding more positive examples with similar proxy meta-data with the anchor, assuming they share similar discriminative semantic features. With our method, a 3D CNN model pre-trained on 10(4) multi-site healthy brain MRI scans can extract relevant features for three classification tasks: schizophrenia, bipolar diagnosis and Alzheimer's detection. When fine-tuned, it also outperforms 3D CNN trained from scratch on these tasks, as well as state-of-the-art self-supervised methods. Our code is made publicly available here.

引用

页码：58 / 68

页数：11

共 29 条

[1]

Caron M, 2020, ADV NEUR IN, V33

[2]

Chaitanya K., 2020, Contrastive learning of global and local features for medical image segmentation with limited annotations

[3]

Chapelle Olivier, 2001, VICINAL RISK MINIMIZ

[4] Self-supervised learning for medical image analysis using image context restoration [J].

Chen, Liang ;

Bentley, Paul ;

Mori, Kensaku ;

Misawa, Kazunari ;

Fujiwara, Michitaka ;

Rueckert, Daniel .

MEDICAL IMAGE ANALYSIS, 2019, 58

[5]

Chen T, 2020, PR MACH LEARN RES, V119

[6]

Chuang C.Y., 2020, NEURIPS

[7]

Devries T., 2017, Dataset augmentation in feature space

[8]

Ding X., 2021, ICLR

[9] Unsupervised Visual Representation Learning by Context Prediction [J].

Doersch, Carl ;

Gupta, Abhinav ;

Efros, Alexei A. .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1422-1430

[10]

Grill Jean-Bastien, 2020, ADV NEURAL INF PROCE, V33, P21271

← 1 2 3 →