Non-uniform Label Smoothing for Diabetic Retinopathy Grading from Retinal Fundus Images with Deep Neural Networks

被引:12
作者
Galdran, Adrian [1 ,2 ]
Chelbi, Jihed [3 ]
Kobi, Riadh [3 ]
Dolz, Jose [1 ]
Lombaert, Herve [1 ]
ben Ayed, Ismail [1 ]
Chakor, Hadi [3 ]
机构
[1] Cole Technol Super Montreal, Montreal, PQ, Canada
[2] Univ Bournemouth, Poole, Dorset, England
[3] Diag INC, Brossard, PQ, Canada
关键词
diabetic retinopathy grading; retinal image analysis; label smoothing; deep learning;
D O I
10.1167/tvst.9.2.34
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
Purpose: Introducing a new technique to improve deep learning (DL) models designed for automatic grading of diabetic retinopathy (DR) from retinal fundus images by enhancing predictions' consistency. Methods: A convolutional neural network (CNN) was optimized in three different manners to predict DR grade from eye fundus images. The optimization criteria were (1) the standard cross-entropy (CE) loss; (2) CE supplemented with label smoothing (LS), a regularization approach widely employed in computer vision tasks; and (3) our proposed non-uniform label smoothing (N-ULS), a modification of LS that models the underlying structure of expert annotations. Results: Performance was measured in terms of quadratic-weighted kappa score (quad-kappa) and average area under the receiver operating curve (AUROC), as well as with suitable metrics for analyzing diagnostic consistency, like weighted precision, recall, and F1 score, or Matthews correlation coefficient. While LS generally harmed the performance of the CNN, N-ULS statistically significantly improved performance with respect to CE in terms quad-kappa score (73.17 vs. 77.69, P < 0.025), without any performance decrease in average AUROC. N-ULS achieved this while simultaneously increasing performance for all other analyzed metrics. Conclusions: For extending standard modeling approaches from DR detection to the more complex task of DR grading, it is essential to consider the underlying structure of expert annotations. The approach introduced in this article can be easily implemented in conjunction with deep neural networks to increase their consistency without sacrificing per-class performance. Translational Relevance: A straightforward modification of current standard training practices of CNNs can substantially improve consistency in DR grading, better modeling expert annotations and human variability.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 26 条
[11]   Cost of a Community-Based Diabetic Retinopathy Screening Program [J].
Byrne, Margaret M. ;
Parker, Dorothy F. ;
Tannenbaum, Stacey L. ;
Ocasio, Manuel A. ;
Lam, Byron L. ;
Zimmer-Galler, Ingrid ;
Lee, David J. .
DIABETES CARE, 2014, 37 (11) :E236-E237
[12]   A Weakly-Supervised Framework for Interpretable Diabetic Retinopathy Detection on Retinal Images [J].
Costa, Pedro ;
Galdran, Adrian ;
Smailagic, Asim ;
Campilho, Aurelio .
IEEE ACCESS, 2018, 6 :18747-18758
[13]   MULTIPLE COMPARISONS AMONG MEANS [J].
DUNN, OJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1961, 56 (293) :52-&
[14]   Advances in Retinal Imaging and Applications in Diabetic Retinopathy Screening: A Review [J].
Fenner, Beau J. ;
Wong, Raymond L. M. ;
Lam, Wai-Ching ;
Tan, Gavin S. W. ;
Cheung, Gemmy C. M. .
OPHTHALMOLOGY AND THERAPY, 2018, 7 (02) :333-346
[15]   Automated Identification of Diabetic Retinopathy Using Deep Learning [J].
Gargeya, Rishab ;
Leng, Theodore .
OPHTHALMOLOGY, 2017, 124 (07) :962-969
[16]   Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs [J].
Gulshan, Varun ;
Peng, Lily ;
Coram, Marc ;
Stumpe, Martin C. ;
Wu, Derek ;
Narayanaswamy, Arunachalam ;
Venugopalan, Subhashini ;
Widner, Kasumi ;
Madams, Tom ;
Cuadros, Jorge ;
Kim, Ramasamy ;
Raman, Rajiv ;
Nelson, Philip C. ;
Mega, Jessica L. ;
Webster, R. .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2016, 316 (22) :2402-2410
[17]   A simple generalisation of the area under the ROC curve for multiple class classification problems [J].
Hand, DJ ;
Till, RJ .
MACHINE LEARNING, 2001, 45 (02) :171-186
[18]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[19]   Grader Variability and the Importance of Reference Standards for Evaluating Machine Learning Models for Diabetic Retinopathy [J].
Krause, Jonathan ;
Gulshan, Varun ;
Rahimy, Ehsan ;
Karth, Peter ;
Widner, Kasumi ;
Corrado, Greg S. ;
Peng, Lily ;
Webster, Dale R. .
OPHTHALMOLOGY, 2018, 125 (08) :1264-1272
[20]  
Kukacka J, 2018, ARXIV171010686