Expert-validated estimation of diagnostic uncertainty for deep neural networks in diabetic retinopathy detection

被引:52
作者
Ayhan, Murat Seckin [1 ]
Kuhlewein, Laura [1 ,2 ]
Aliyeva, Gulnar [2 ]
Inhoffen, Werner [2 ]
Ziemssen, Focke [2 ]
Berens, Philipp [1 ,3 ,4 ]
机构
[1] Univ Tubingen, Inst Ophthalm Res, Tubingen, Germany
[2] Univ Tubingen, Univ Eye Clin, Tubingen, Germany
[3] Univ Tubingen, Bernstein Ctr Computat Neurosci, Tubingen, Germany
[4] Univ Tubingen, Inst Bioinformat & Med Informat, Tubingen, Germany
关键词
Deep neural networks; Diabetic retinopathy; Uncertainty; Calibration; CLASSIFICATION; MEDICINE; CANCER;
D O I
10.1016/j.media.2020.101724
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning-based systems can achieve a diagnostic performance comparable to physicians in a variety of medical use cases including the diagnosis of diabetic retinopathy. To be useful in clinical practice, it is necessary to have well calibrated measures of the uncertainty with which these systems report their decisions. However, deep neural networks (DNNs) are being often overconfident in their predictions, and are not amenable to a straightforward probabilistic treatment. Here, we describe an intuitive framework based on test-time data augmentation for quantifying the diagnostic uncertainty of a state-of-the-art DNN for diagnosing diabetic retinopathy. We show that the derived measure of uncertainty is well-calibrated and that experienced physicians likewise find cases with uncertain diagnosis difficult to evaluate. This paves the way for an integrated treatment of uncertainty in DNN-based diagnostic systems. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:13
相关论文
共 79 条
[1]  
Abadi M., 2015, TENSORFLOW LARGE SCA
[2]   Pivotal trial of an autonomous AI-based diagnostic system for detection of diabetic retinopathy in primary care offices [J].
Abramoff, Michael D. ;
Lavin, Philip T. ;
Birch, Michele ;
Shah, Nilay ;
Folk, James C. .
NPJ DIGITAL MEDICINE, 2018, 1
[3]  
[Anonymous], 2017, ERASERELU SIMPLE WAY
[4]  
[Anonymous], FDA NEWS
[5]   Deep Learning Predicts OCT Measures of Diabetic Macular Thickening From Color Fundus Photographs [J].
Arcadu, Filippo ;
Benmansour, Fethallah ;
Maunz, Andreas ;
Michon, John ;
Haskova, Zdenka ;
McClintock, Dana ;
Adamis, Anthony P. ;
Willis, Jeffrey R. ;
Prunotto, Marco .
INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2019, 60 (04) :852-857
[6]   End-to-end lung cancer screening with three-dimensional deep learning on low-dose chest computed tomography [J].
Ardila, Diego ;
Kiraly, Atilla P. ;
Bharadwaj, Sujeeth ;
Choi, Bokyung ;
Reicher, Joshua J. ;
Peng, Lily ;
Tse, Daniel ;
Etemadi, Mozziyar ;
Ye, Wenxing ;
Corrado, Greg ;
Naidich, David P. ;
Shetty, Shravya .
NATURE MEDICINE, 2019, 25 (06) :954-+
[7]  
Ayhan M. S., 2018, Medical imaging with deep learning, P1
[8]   Defining and Measuring Diagnostic Uncertainty in Medicine: A Systematic Review [J].
Bhise, Viraj ;
Rajan, Suja S. ;
Sittig, Dean F. ;
Morgan, Robert O. ;
Chaudhary, Pooja ;
Singh, Hardeep .
JOURNAL OF GENERAL INTERNAL MEDICINE, 2018, 33 (01) :103-115
[9]  
Bishop CM., 2006, Pattern Recognition and Machine Learning
[10]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46