A transfer learning-based multimodal neural network combining metadata and multiple medical images for glaucoma type diagnosis

被引:5
作者
Li, Yi [1 ]
Han, Yujie [1 ]
Li, Zihan [2 ]
Zhong, Yi [3 ]
Guo, Zhifen [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang, Liaoning, Peoples R China
[2] Northeastern Univ, Coll Software, Shenyang, Liaoning, Peoples R China
[3] Northeastern Univ, Coll Met, Shenyang, Liaoning, Peoples R China
关键词
OPTICAL COHERENCE TOMOGRAPHY; OPEN-ANGLE GLAUCOMA; CUP SEGMENTATION; PREVALENCE; CLASSIFICATION; DISC;
D O I
10.1038/s41598-022-27045-6
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Glaucoma is an acquired optic neuropathy, which can lead to irreversible vision loss. Deep learning(DL), especially convolutional neural networks(CNN), has achieved considerable success in the field of medical image recognition due to the availability of large-scale annotated datasets and CNNs. However, obtaining fully annotated datasets like ImageNet in the medical field is still a challenge. Meanwhile, single-modal approaches remain both unreliable and inaccurate due to the diversity of glaucoma disease types and the complexity of symptoms. In this paper, a new multimodal dataset for glaucoma is constructed and a new multimodal neural network for glaucoma diagnosis and classification (GMNNnet) is proposed aiming to address both of these issues. Specifically, the dataset includes the five most important types of glaucoma labels, electronic medical records and four kinds of high-resolution medical images. The structure of GMNNnet consists of three branches. Branch 1 consisting of convolutional, cyclic and transposition layers processes patient metadata, branch 2 uses Unet to extract features from glaucoma segmentation based on domain knowledge, and branch 3 uses ResFormer to directly process glaucoma medical images.Branch one and branch two are mixed together and then processed by the Catboost classifier. We introduce a gradient-weighted class activation mapping (Grad-GAM) method to increase the interpretability of the model and a transfer learning method for the case of insufficient training data,i.e.,fine-tuning CNN models pre-trained from natural image dataset to medical image tasks. The results show that GMNNnet can better present the high-dimensional information of glaucoma and achieves excellent performance under multimodal data.
引用
收藏
页数:13
相关论文
共 31 条
[1]   Re: Tham et al.: Global prevalence of glaucoma and projections of glaucoma burden through 2040: a systematic review and meta-analysis (Ophthalmology 2014;121:2081-90) [J].
Barkana, Yaniv ;
Dorairaj, Syril .
OPHTHALMOLOGY, 2015, 122 (07) :E40-E41
[2]   Robust Vessel Segmentation in Fundus Images [J].
Budai, A. ;
Bock, R. ;
Maier, A. ;
Hornegger, J. ;
Michelson, G. .
INTERNATIONAL JOURNAL OF BIOMEDICAL IMAGING, 2013, 2013 (2013)
[3]   Exploring Early Glaucoma and the Visual Field Test: Classification and Clustering Using Bayesian Networks [J].
Ceccon, Stefano ;
Garway-Heath, David F. ;
Crabb, David P. ;
Tucker, Allan .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2014, 18 (03) :1008-1014
[4]   Automatic Feature Learning for Glaucoma Detection Based on Deep Learning [J].
Chen, Xiangyu ;
Xu, Yanwu ;
Yan, Shuicheng ;
Wong, Damon Wing Kee ;
Wong, Tien Yin ;
Liu, Jiang .
MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 :669-677
[5]   FEEDBACK ON A PUBLICLY DISTRIBUTED IMAGE DATABASE: THE MESSIDOR DATABASE [J].
Decenciere, Etienne ;
Zhang, Xiwei ;
Cazuguel, Guy ;
Lay, Bruno ;
Cochener, Beatrice ;
Trone, Caroline ;
Gain, Philippe ;
Ordonez-Varela, John-Richard ;
Massin, Pascale ;
Erginay, Ali ;
Charton, Beatrice ;
Klein, Jean-Claude .
IMAGE ANALYSIS & STEREOLOGY, 2014, 33 (03) :231-234
[6]   CNNs for automatic glaucoma assessment using fundus images: an extensive validation [J].
Diaz-Pinto, Andres ;
Morales, Sandra ;
Naranjo, Valery ;
Koehler, Thomas ;
Mossi, Jose M. ;
Navea, Amparo .
BIOMEDICAL ENGINEERING ONLINE, 2019, 18 (1)
[7]  
Draelos R. L., 2020, Use hirescam instead of grad-cam for faithful explanations of convolutional neural networks
[8]   Disc-Aware Ensemble Network for Glaucoma Screening From Fundus Image [J].
Fu, Huazhu ;
Cheng, Jun ;
Xu, Yanwu ;
Zhang, Changqing ;
Wong, Damon Wing Kee ;
Liu, Jiang ;
Cao, Xiaochun .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2018, 37 (11) :2493-2501
[9]   Joint Optic Disc and Cup Segmentation Based on Multi-Label Deep Network and Polar Transformation [J].
Fu, Huazhu ;
Cheng, Jun ;
Xu, Yanwu ;
Wong, Damon Wing Kee ;
Liu, Jiang ;
Cao, Xiaochun .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2018, 37 (07) :1597-1605
[10]  
Fumero F, 2011, COMP MED SY