Self-supervised multimodal reconstruction of retinal images over paired datasets

被引:22
作者
Hervella, Alvaro S. [1 ,2 ]
Rouco, Jose [1 ,2 ]
Novo, Jorge [1 ,2 ]
Ortega, Marcos [1 ,2 ]
机构
[1] Univ A Coruna, CITIC Res Ctr Informat & Commun Technol, La Coruna, Spain
[2] Univ A Coruna, Dept Comp Sci, La Coruna, Spain
关键词
Self-supervised learning; Eye fundus; Deep learning; Multimodal; Retinography; Angiography; CONVOLUTIONAL NEURAL-NETWORKS;
D O I
10.1016/j.eswa.2020.113674
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data scarcity represents an important constraint for the training of deep neural networks in medical imaging. Medical image labeling, especially if pixel-level annotations are required, is an expensive task that needs expert intervention and usually results in a reduced number of annotated samples. In contrast, extensive amounts of unlabeled data are produced in the daily clinical practice, including paired multi-modal images from patients that were subjected to multiple imaging tests. This work proposes a novel self-supervised multimodal reconstruction task that takes advantage of this unlabeled multimodal data for learning about the domain without human supervision. Paired multimodal data is a rich source of clinical information that can be naturally exploited by trying to estimate one image modality from others. This multimodal reconstruction requires the recognition of domain-specific patterns that can be used to complement the training of image analysis tasks in the same domain for which annotated data is scarce. In this work, a set of experiments is performed using a multimodal setting of retinography and fluorescein angiography pairs that offer complementary information about the eye fundus. The evaluations performed on different public datasets, which include pathological and healthy data samples, demonstrate that a network trained for self-supervised multimodal reconstruction of angiography from retinography achieves unsupervised recognition of important retinal structures. These results indicate that the proposed self-supervised task provides relevant cues for image analysis tasks in the same domain. (c) 2020 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页数:14
相关论文
共 42 条
[1]  
Agrawal P., 2015, INT C COMP VIS ICCV
[2]   Diabetic Retinopathy Grading by Digital Curvelet Transform [J].
Alipour, Shirin Hajeb Mohammad ;
Rabbani, Hossein ;
Akhlaghi, Mohammad Reza .
COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2012, 2012
[3]  
[Anonymous], 2017, INT C LEARN REPR ICL
[4]  
[Anonymous], 2017, CORR
[5]  
[Anonymous], 2015, VERY DEEP CONVOLUTIO
[6]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[7]   End-to-End Adversarial Retinal Image Synthesis [J].
Costa, Pedro ;
Galdran, Adrian ;
Meyer, Maria Ines ;
Niemeijer, Meindert ;
Abramoff, Michael ;
Mendonca, Ana Maria ;
Campilho, Aurelio .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2018, 37 (03) :781-791
[8]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[9]  
Doersch C., 2015, INT C COMP VIS ICCV
[10]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338