A transfer learning-based multimodal neural network combining metadata and multiple medical images for glaucoma type diagnosis

被引:5
作者
Li, Yi [1 ]
Han, Yujie [1 ]
Li, Zihan [2 ]
Zhong, Yi [3 ]
Guo, Zhifen [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang, Liaoning, Peoples R China
[2] Northeastern Univ, Coll Software, Shenyang, Liaoning, Peoples R China
[3] Northeastern Univ, Coll Met, Shenyang, Liaoning, Peoples R China
关键词
OPTICAL COHERENCE TOMOGRAPHY; OPEN-ANGLE GLAUCOMA; CUP SEGMENTATION; PREVALENCE; CLASSIFICATION; DISC;
D O I
10.1038/s41598-022-27045-6
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Glaucoma is an acquired optic neuropathy, which can lead to irreversible vision loss. Deep learning(DL), especially convolutional neural networks(CNN), has achieved considerable success in the field of medical image recognition due to the availability of large-scale annotated datasets and CNNs. However, obtaining fully annotated datasets like ImageNet in the medical field is still a challenge. Meanwhile, single-modal approaches remain both unreliable and inaccurate due to the diversity of glaucoma disease types and the complexity of symptoms. In this paper, a new multimodal dataset for glaucoma is constructed and a new multimodal neural network for glaucoma diagnosis and classification (GMNNnet) is proposed aiming to address both of these issues. Specifically, the dataset includes the five most important types of glaucoma labels, electronic medical records and four kinds of high-resolution medical images. The structure of GMNNnet consists of three branches. Branch 1 consisting of convolutional, cyclic and transposition layers processes patient metadata, branch 2 uses Unet to extract features from glaucoma segmentation based on domain knowledge, and branch 3 uses ResFormer to directly process glaucoma medical images.Branch one and branch two are mixed together and then processed by the Catboost classifier. We introduce a gradient-weighted class activation mapping (Grad-GAM) method to increase the interpretability of the model and a transfer learning method for the case of insufficient training data,i.e.,fine-tuning CNN models pre-trained from natural image dataset to medical image tasks. The results show that GMNNnet can better present the high-dimensional information of glaucoma and achieves excellent performance under multimodal data.
引用
收藏
页数:13
相关论文
共 31 条
[11]  
Hervella AS, 2020, INT CONF ACOUST SPEE, P961, DOI [10.1109/ICASSP40776.2020.9053551, 10.1109/icassp40776.2020.9053551]
[12]   Multimodal Retinal Vessel Segmentation From Spectral-Domain Optical Coherence Tomography and Fundus Photography [J].
Hu, Zhihong ;
Niemeijer, Meindert ;
Abramoff, Michael D. ;
Garvin, Mona K. .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2012, 31 (10) :1900-1911
[13]  
Huazhu Fu, 2016, Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016. 19th International Conference. Proceedings: LNCS 9901, P132, DOI 10.1007/978-3-319-46723-8_16
[14]   JointRCNN: A Region-Based Convolutional Neural Network for Optic Disc and Cup Segmentation [J].
Jiang, Yuming ;
Duan, Lixin ;
Cheng, Jun ;
Gu, Zaiwang ;
Xia, Hu ;
Fu, Huazhu ;
Li, Changsheng ;
Liu, Jiang .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2020, 67 (02) :335-343
[15]   Construction of Retinal Vessel Segmentation Models Based on Convolutional Neural Network [J].
Jin, Qiangguo ;
Chen, Qi ;
Meng, Zhaopeng ;
Wang, Bing ;
Su, Ran .
NEURAL PROCESSING LETTERS, 2020, 52 (02) :1005-1022
[16]   Global variations and time trends in the prevalence of primary open angle glaucoma (POAG): a systematic review and meta-analysis [J].
Kapetanakis, Venediktos V. ;
Chan, Michelle P. Y. ;
Foster, Paul J. ;
Cook, Derek G. ;
Owen, Christopher G. ;
Rudnicka, Alicja R. .
BRITISH JOURNAL OF OPHTHALMOLOGY, 2016, 100 (01) :86-93
[17]   Diagnostic Classification of Macular Ganglion Cell and Retinal Nerve Fiber Layer Analysis [J].
Kim, Ko Eun ;
Jeoung, Jin Wook ;
Park, Ki Ho ;
Kim, Dong Myung ;
Kim, Seok Hwan .
OPHTHALMOLOGY, 2015, 122 (03) :502-510
[18]   Retinal thickness measurements from optical coherence tomography using a Markov boundary model [J].
Koozekanani, D ;
Boyer, K ;
Roberts, C .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2001, 20 (09) :900-916
[19]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[20]   A Large-Scale Database and a CNN Model for Attention-Based Glaucoma Detection [J].
Li, Liu ;
Xu, Mai ;
Liu, Hanruo ;
Li, Yang ;
Wang, Xiaofei ;
Jiang, Lai ;
Wang, Zulin ;
Fan, Xiang ;
Wang, Ningli .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (02) :413-424