Improved Automated Detection of Diabetic Retinopathy on a Publicly Available Dataset Through Integration of Deep Learning

被引:684
作者
Abramoff, Michael David [1 ,2 ,3 ]
Lou, Yiyue [4 ]
Erginay, Ali [5 ]
Clarida, Warren [3 ]
Amelon, Ryan [3 ]
Folk, James C. [1 ,3 ]
Niemeijer, Meindert [3 ]
机构
[1] Univ Iowa, Hosp & Clin, Dept Ophthalmol & Visual Sci, Iowa City, IA USA
[2] Iowa City Vet Affairs Med Ctr, Iowa City, IA USA
[3] IDx LLC, Iowa City, IA USA
[4] Univ Iowa, Dept Stat, Coll Publ Hlth, Iowa City, IA USA
[5] Hop Lariboisiere, AP HP, Serv Ophtalmol, Paris, France
关键词
diabetic retinopathy; detection; deep learning; algorithm; diabetes; MACULAR EDEMA; SCREENING EXAMINATIONS; IMAGE-ANALYSIS; TELEMEDICINE; PHOTOGRAPHY;
D O I
10.1167/iovs.16-19964
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
PURPOSE. To compare performance of a deep-learning enhanced algorithm for automated detection of diabetic retinopathy (DR), to the previously published performance of that algorithm, the Iowa Detection Program (IDP)-without deep learning components-on the same publicly available set of fundus images and previously reported consensus reference standard set, by three US Board certified retinal specialists. METHODS. We used the previously reported consensus reference standard of referable DR (rDR), defined as International Clinical Classification of Diabetic Retinopathy moderate, severe nonproliferative (NPDR), proliferative DR, and/or macular edema (ME). Neither Messidor-2 images, nor the three retinal specialists setting the Messidor-2 reference standard were used for training IDx-DR version X2.1. Sensitivity, specificity, negative predictive value, area under the curve (AUC), and their confidence intervals (CIs) were calculated. RESULTS. Sensitivity was 96.8% (95% CI: 93.3%-98.8%), specificity was 87.0% (95% CI: 84.2%-89.4%), with 6/874 false negatives, resulting in a negative predictive value of 99.0% (95% CI: 97.8%-99.6%). No cases of severe NPDR, PDR, or ME were missed. The AUC was 0.980 (95% CI: 0.968-0.992). Sensitivity was not statistically different from published IDP sensitivity, which had a CI of 94.4% to 99.3%, but specificity was significantly better than the published IDP specificity CI of 55.7% to 63.0%. CONCLUSIONS. A deep-learning enhanced algorithm for the automated detection of DR, achieves significantly better performance than a previously reported, otherwise essentially identical, algorithm that does not employ deep learning. Deep learning enhanced algorithms have the potential to improve the efficiency of DR screening, and thereby to prevent visual loss and blindness from this devastating disease.
引用
收藏
页码:5200 / 5206
页数:7
相关论文
共 40 条
  • [1] Web-based screening for diabetic retinopathy in a primary care population: The EyeCheck project
    Abramoff, MD
    Suttorp-Schulten, MSA
    [J]. TELEMEDICINE JOURNAL AND E-HEALTH, 2005, 11 (06): : 668 - 674
  • [2] Automated Analysis of Retinal Images for Detection of Referable Diabetic Retinopathy
    Abramoff, Michael D.
    Folk, James C.
    Han, Dennis P.
    Walker, Jonathan D.
    Williams, David F.
    Russell, Stephen R.
    Massin, Pascale
    Cochener, Beatrice
    Gain, Philippe
    Tang, Li
    Lamard, Mathieu
    Moga, Daniela C.
    Quellec, Gwenole
    Niemeijer, Meindert
    [J]. JAMA OPHTHALMOLOGY, 2013, 131 (03) : 351 - 357
  • [3] Automated Early Detection of Diabetic Retinopathy
    Abramoff, Michael D.
    Reinhardt, Joseph M.
    Russell, Stephen R.
    Folk, James C.
    Mahajan, Vinit B.
    Niemeijer, Meindert
    Quellec, Gwenole
    [J]. OPHTHALMOLOGY, 2010, 117 (06) : 1147 - 1154
  • [4] DIAGNOSTIC-TESTS-3 - RECEIVER OPERATING CHARACTERISTIC PLOTS .7.
    ALTMAN, DG
    BLAND, JM
    [J]. BRITISH MEDICAL JOURNAL, 1994, 309 (6948) : 188 - 188
  • [5] [Anonymous], 1991, OPHTHALMOLOGY, V98, P823
  • [6] [Anonymous], 1978, Ophthalmology, V85, P82
  • [7] [Anonymous], 2008, NAT DIAB FACT SHEET
  • [8] [Anonymous], P 1 INT WORKSH COMP
  • [9] [Anonymous], 2015, DIAB RET DET
  • [10] [Anonymous], 2012, Diabetes Care, DOI DOI 10.2337/DC12-S004