Non-uniform Label Smoothing for Diabetic Retinopathy Grading from Retinal Fundus Images with Deep Neural Networks

被引：12

作者：

Galdran, Adrian ^{[1
,2
]}

Chelbi, Jihed ^{[3
]}

Kobi, Riadh ^{[3
]}

Dolz, Jose ^{[1
]}

Lombaert, Herve ^{[1
]}

ben Ayed, Ismail ^{[1
]}

Chakor, Hadi ^{[3
]}

机构：

[1] Cole Technol Super Montreal, Montreal, PQ, Canada

[2] Univ Bournemouth, Poole, Dorset, England

[3] Diag INC, Brossard, PQ, Canada

来源：

TRANSLATIONAL VISION SCIENCE & TECHNOLOGY | 2020年 / 9卷 / 02期

关键词：

diabetic retinopathy grading; retinal image analysis; label smoothing; deep learning;

D O I：

10.1167/tvst.9.2.34

中图分类号：

R77 [眼科学];

学科分类号：

100212 ;

摘要：

Purpose: Introducing a new technique to improve deep learning (DL) models designed for automatic grading of diabetic retinopathy (DR) from retinal fundus images by enhancing predictions' consistency. Methods: A convolutional neural network (CNN) was optimized in three different manners to predict DR grade from eye fundus images. The optimization criteria were (1) the standard cross-entropy (CE) loss; (2) CE supplemented with label smoothing (LS), a regularization approach widely employed in computer vision tasks; and (3) our proposed non-uniform label smoothing (N-ULS), a modification of LS that models the underlying structure of expert annotations. Results: Performance was measured in terms of quadratic-weighted kappa score (quad-kappa) and average area under the receiver operating curve (AUROC), as well as with suitable metrics for analyzing diagnostic consistency, like weighted precision, recall, and F1 score, or Matthews correlation coefficient. While LS generally harmed the performance of the CNN, N-ULS statistically significantly improved performance with respect to CE in terms quad-kappa score (73.17 vs. 77.69, P < 0.025), without any performance decrease in average AUROC. N-ULS achieved this while simultaneously increasing performance for all other analyzed metrics. Conclusions: For extending standard modeling approaches from DR detection to the more complex task of DR grading, it is essential to consider the underlying structure of expert annotations. The approach introduced in this article can be easily implemented in conjunction with deep neural networks to increase their consistency without sacrificing per-class performance. Translational Relevance: A straightforward modification of current standard training practices of CNNs can substantially improve consistency in DR grading, better modeling expert annotations and human variability.

引用

页码：1 / 8

页数：8

共 26 条

[11] Cost of a Community-Based Diabetic Retinopathy Screening Program [J].