Attention-Guided 3D-CNN Framework for Glaucoma Detection and Structural-Functional Association Using Volumetric Images

被引:36
作者
George, Yasmeen [1 ]
Antony, Bhavna J. [1 ]
Ishikawa, Hiroshi [2 ]
Wollstein, Gadi [2 ]
Schuman, Joel S. [2 ]
Garnavi, Rahil [1 ]
机构
[1] IBM Res, Melbourne, Vic 3006, Australia
[2] NYU Langone Hlth, Dept Ophthalmol, New York, NY 10017 USA
基金
美国国家卫生研究院;
关键词
Three-dimensional displays; Visualization; Retina; Computational modeling; Heating systems; Training; Biomedical optical imaging; 3D convolutional neural networks; optical coherence tomography; gradient-weighted class activation maps; glaucoma detection; visual field estimation; attention guided deep learning; NEURAL-NETWORK; SEGMENTATION; IMPACT;
D O I
10.1109/JBHI.2020.3001019
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The direct analysis of 3D Optical Coherence Tomography (OCT) volumes enables deep learning models (DL) to learn spatial structural information and discover new bio-markers that are relevant to glaucoma. Downsampling 3D input volumes is the state-of-art solution to accommodate for the limited number of training volumes as well as the available computing resources. However, this limits the network's ability to learn from small retinal structures in OCT volumes. In this paper, our goal is to improve the performance by providing guidance to DL model during training in order to learn from finer ocular structures in 3D OCT volumes. Therefore, we propose an end-to-end attention guided 3D DL model for glaucoma detection and estimating visual function from retinal structures. The model consists of three pathways with the same network architecture but different inputs. One input is the original 3D-OCT cube and the other two are computed during training guided by the 3D gradient class activation heatmaps. Each pathway outputs the class-label and the whole model is trained concurrently to minimize the sum of losses from three pathways. The final output is obtained by fusing the predictions of the three pathways. Also, to explore the robustness and generalizability of the proposed model, we apply the model on a classification task for glaucoma detection as well as a regression task to estimate visual field index (VFI) (a value between 0 and 100). A 5-fold cross-validation with a total of 3782 and 10,370 OCT scans is used to train and evaluate the classification and regression models, respectively. The glaucoma detection model achieved an area under the curve (AUC) of 93.8% compared with 86.8% for a baseline model without the attention-guided component. The model also outperformed six different feature based machine learning approaches that use scanner computed measurements for training. Further, we also assessed the contribution of different retinal layers that are relevant to glaucoma. The VFI estimation model achieved a Pearson correlation and median absolute error of 0.75 and 3.6%, respectively, for a test set of size 3100 cubes.
引用
收藏
页码:3421 / 3430
页数:10
相关论文
共 50 条
  • [1] A deep learning model for the detection of both advanced and early glaucoma using fundus photography
    Ahn, Jin Mo
    Kim, Sangsoo
    Ahn, Kwang-Sung
    Cho, Sung-Hoon
    Lee, Kwan Bok
    Kim, Ungsoo Samuel
    [J]. PLOS ONE, 2018, 13 (11):
  • [2] Glaucoma Diagnosis with Machine Learning Based on Optical Coherence Tomography and Color Fundus Images
    An, Guangzhou
    Omodaka, Kazuko
    Hashimoto, Kazuki
    Tsuda, Satoru
    Shiga, Yukihiro
    Takada, Naoko
    Kikawa, Tsutomu
    Yokota, Hideo
    Akiba, Masahiro
    Nakazawa, Toru
    [J]. JOURNAL OF HEALTHCARE ENGINEERING, 2019, 2019
  • [3] Using Deep Learning and Transfer Learning to Accurately Diagnose Early-Onset Glaucoma From Macular Optical Coherence Tomography Images
    Asaoka, Ryo
    Murata, Hiroshi
    Hirasawa, Kazunori
    Fujino, Yuri
    Matsuura, Masato
    Miki, Atsuya
    Kanamoto, Takashi
    Ikeda, Yoko
    Mori, Kazuhiko
    Iwase, Aiko
    Shoji, Nobuyuki
    Inoue, Kenji
    Yamagami, Junkichi
    Araie, Makoto
    [J]. AMERICAN JOURNAL OF OPHTHALMOLOGY, 2019, 198 : 136 - 145
  • [4] Artificial Intelligence, Machine Learning, Deep Learning, and Cognitive Computing: What Do These Terms Mean and How Will They Impact Health Care?
    Bini, Stefano A.
    [J]. JOURNAL OF ARTHROPLASTY, 2018, 33 (08) : 2358 - 2361
  • [5] Breiman L., 2001, Mach. Learn., V45, P5
  • [6] Broadway David C, 2012, Community Eye Health, V25, P66
  • [7] Results of the European Glaucoma Prevention Study
    Centofanti, M
    Zeyen, T
    Miglior, S
    [J]. OPHTHALMOLOGY, 2005, 112 (03) : 366 - 375
  • [8] Dasgupta A, 2017, I S BIOMED IMAGING, P248, DOI 10.1109/ISBI.2017.7950512
  • [9] Glaucoma: the retina and beyond
    Davis, Benjamin Michael
    Crawley, Laura
    Pahlitzsch, Milena
    Javaid, Fatimah
    Cordeiro, Maria Francesca
    [J]. ACTA NEUROPATHOLOGICA, 2016, 132 (06) : 807 - 826
  • [10] Global causes of blindness and distance vision impairment 1990-2020: a systematic review and meta-analysis
    Flaxman, Seth R.
    Bourne, Rupert R. A.
    Resnikoff, Serge
    Ackland, Peter
    Braithwaite, Tasanee
    Cicinelli, Maria V.
    Das, Aditi
    Jonas, Jost B.
    Keeffe, Jill
    Kempen, John H.
    Leasher, Janet
    Limburg, Hans
    Naidoo, Kovin
    Pesudovs, Konrad
    Silvester, Alex
    Stevens, Gretchen A.
    Tahhan, Nina
    Wong, Tien Y.
    Taylor, Hugh R.
    [J]. LANCET GLOBAL HEALTH, 2017, 5 (12): : E1221 - E1234