CABNet: Category Attention Block for Imbalanced Diabetic Retinopathy Grading

被引:193
作者
He, Along [1 ]
Li, Tao [1 ]
Li, Ning [1 ]
Wang, Kai [1 ]
Fu, Huazhu [2 ]
机构
[1] Nankai Univ, Coll Comp Sci, Tianjin 300350, Peoples R China
[2] Incept Inst Artificial Intelligence IIAI, Abu Dhabi, U Arab Emirates
关键词
Lesions; Task analysis; Feature extraction; Diabetes; Machine learning; Image segmentation; Training; Diabetic retinopathy grading; attention mechanism; category attention block (CAB); global attention block (GAB); NEURAL-NETWORK;
D O I
10.1109/TMI.2020.3023463
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Diabetic Retinopathy (DR) grading is challenging due to the presence of intra-class variations, small lesions and imbalanced data distributions. The key for solving fine-grained DR grading is to find more discriminative features corresponding to subtle visual differences, such as microaneurysms, hemorrhages and soft exudates. However, small lesions are quite difficult to identify using traditional convolutional neural networks (CNNs), and an imbalanced DR data distribution will cause the model to pay too much attention to DR grades with more samples, greatly affecting the final grading performance. In this article, we focus on developing an attention module to address these issues. Specifically, for imbalanced DR data distributions, we propose a novel Category Attention Block (CAB), which explores more discriminative region-wise features for each DR grade and treats each category equally. In order to capture more detailed small lesion information, we also propose the Global Attention Block (GAB), which can exploit detailed and class-agnostic global attention feature maps for fundus images. By aggregating the attention blocks with a backbone network, the CABNet is constructed for DR grading. The attention blocks can be applied to a wide range of backbone networks and trained efficiently in an end-to-end manner. Comprehensive experiments are conducted on three publicly available datasets, showing that CABNet produces significant performance improvements for existing state-of-the-art deep architectures with few additional parameters and achieves the state-of-the-art results for DR grading. Code and models will be available at https://github.com/he2016012996/CABnet.
引用
收藏
页码:143 / 153
页数:11
相关论文
共 44 条
[1]   Look and Think Twice: Capturing Top-Down Visual Attention with Feedback Convolutional Neural Networks [J].
Cao, Chunshui ;
Liu, Xianming ;
Yang, Yi ;
Yu, Yinan ;
Wang, Jiang ;
Wang, Zilei ;
Huang, Yongzhen ;
Wang, Liang ;
Huang, Chang ;
Xu, Wei ;
Ramanan, Deva ;
Huang, Thomas S. .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2956-2964
[2]   GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond [J].
Cao, Yue ;
Xu, Jiarui ;
Lin, Stephen ;
Wei, Fangyun ;
Hu, Han .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :1971-1980
[3]   Attention to Scale: Scale-aware Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Yang, Yi ;
Wang, Jiang ;
Xu, Wei ;
Yuille, Alan L. .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3640-3649
[4]  
Chen Y., 2017, IEEE INT CONF SENS, P1
[5]   IDF Diabetes Atlas: Global estimates of diabetes prevalence for 2017 and projections for 2045 [J].
Cho, N. H. ;
Shaw, J. E. ;
Karuranga, S. ;
Huang, Y. ;
Fernandes, J. D. da Rocha ;
Ohlrogge, A. W. ;
Malanda, B. .
DIABETES RESEARCH AND CLINICAL PRACTICE, 2018, 138 :271-281
[6]   Attention-based Dropout Layer for Weakly Supervised Object Localization [J].
Choe, Junsuk ;
Shim, Hyunjung .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2214-2223
[7]   Xception: Deep Learning with Depthwise Separable Convolutions [J].
Chollet, Francois .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807
[8]  
Dai L, 2017, COMPUTER SCIENCE AND TECHNOLOGY (CST2016), P525
[9]   A deep learning interpretable classifier for diabetic retinopathy disease grading [J].
de la Torre, Jordi ;
Valls, Aida ;
Puig, Domenec .
NEUROCOMPUTING, 2020, 396 :465-476
[10]   FEEDBACK ON A PUBLICLY DISTRIBUTED IMAGE DATABASE: THE MESSIDOR DATABASE [J].
Decenciere, Etienne ;
Zhang, Xiwei ;
Cazuguel, Guy ;
Lay, Bruno ;
Cochener, Beatrice ;
Trone, Caroline ;
Gain, Philippe ;
Ordonez-Varela, John-Richard ;
Massin, Pascale ;
Erginay, Ali ;
Charton, Beatrice ;
Klein, Jean-Claude .
IMAGE ANALYSIS & STEREOLOGY, 2014, 33 (03) :231-234