CABNet: Category Attention Block for Imbalanced Diabetic Retinopathy Grading

被引：193

作者：

He, Along ^{[1
]}

Li, Tao ^{[1
]}

Li, Ning ^{[1
]}

Wang, Kai ^{[1
]}

Fu, Huazhu ^{[2
]}

机构：

[1] Nankai Univ, Coll Comp Sci, Tianjin 300350, Peoples R China

[2] Incept Inst Artificial Intelligence IIAI, Abu Dhabi, U Arab Emirates

来源：

IEEE TRANSACTIONS ON MEDICAL IMAGING | 2021年 / 40卷 / 01期

关键词：

Lesions; Task analysis; Feature extraction; Diabetes; Machine learning; Image segmentation; Training; Diabetic retinopathy grading; attention mechanism; category attention block (CAB); global attention block (GAB); NEURAL-NETWORK;

D O I：

10.1109/TMI.2020.3023463

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Diabetic Retinopathy (DR) grading is challenging due to the presence of intra-class variations, small lesions and imbalanced data distributions. The key for solving fine-grained DR grading is to find more discriminative features corresponding to subtle visual differences, such as microaneurysms, hemorrhages and soft exudates. However, small lesions are quite difficult to identify using traditional convolutional neural networks (CNNs), and an imbalanced DR data distribution will cause the model to pay too much attention to DR grades with more samples, greatly affecting the final grading performance. In this article, we focus on developing an attention module to address these issues. Specifically, for imbalanced DR data distributions, we propose a novel Category Attention Block (CAB), which explores more discriminative region-wise features for each DR grade and treats each category equally. In order to capture more detailed small lesion information, we also propose the Global Attention Block (GAB), which can exploit detailed and class-agnostic global attention feature maps for fundus images. By aggregating the attention blocks with a backbone network, the CABNet is constructed for DR grading. The attention blocks can be applied to a wide range of backbone networks and trained efficiently in an end-to-end manner. Comprehensive experiments are conducted on three publicly available datasets, showing that CABNet produces significant performance improvements for existing state-of-the-art deep architectures with few additional parameters and achieves the state-of-the-art results for DR grading. Code and models will be available at https://github.com/he2016012996/CABnet.

引用

页码：143 / 153

页数：11

共 44 条

[1] Look and Think Twice: Capturing Top-Down Visual Attention with Feedback Convolutional Neural Networks [J].

Cao, Chunshui ;

Liu, Xianming ;

Yang, Yi ;

Yu, Yinan ;

Wang, Jiang ;

Wang, Zilei ;

Huang, Yongzhen ;

Wang, Liang ;

Huang, Chang ;

Xu, Wei ;

Ramanan, Deva ;

Huang, Thomas S. .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2956-2964

[2] GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond [J].

Cao, Yue ;

Xu, Jiarui ;

Lin, Stephen ;

Wei, Fangyun ;

Hu, Han .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :1971-1980

[3] Attention to Scale: Scale-aware Semantic Image Segmentation [J].

Chen, Liang-Chieh ;

Yang, Yi ;

Wang, Jiang ;

Xu, Wei ;

Yuille, Alan L. .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3640-3649

[4]

Chen Y., 2017, IEEE INT CONF SENS, P1

[5] IDF Diabetes Atlas: Global estimates of diabetes prevalence for 2017 and projections for 2045 [J].

Cho, N. H. ;

Shaw, J. E. ;

Karuranga, S. ;

Huang, Y. ;

Fernandes, J. D. da Rocha ;

Ohlrogge, A. W. ;

Malanda, B. .

DIABETES RESEARCH AND CLINICAL PRACTICE, 2018, 138 :271-281

[6] Attention-based Dropout Layer for Weakly Supervised Object Localization [J].

Choe, Junsuk ;

Shim, Hyunjung .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2214-2223

[7] Xception: Deep Learning with Depthwise Separable Convolutions [J].

Chollet, Francois .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807

[8]

Dai L, 2017, COMPUTER SCIENCE AND TECHNOLOGY (CST2016), P525

[9] A deep learning interpretable classifier for diabetic retinopathy disease grading [J].

de la Torre, Jordi ;

Valls, Aida ;

Puig, Domenec .

NEUROCOMPUTING, 2020, 396 :465-476

[10] FEEDBACK ON A PUBLICLY DISTRIBUTED IMAGE DATABASE: THE MESSIDOR DATABASE [J].

Decenciere, Etienne ;

Zhang, Xiwei ;

Cazuguel, Guy ;

Lay, Bruno ;

Cochener, Beatrice ;

Trone, Caroline ;

Gain, Philippe ;

Ordonez-Varela, John-Richard ;

Massin, Pascale ;

Erginay, Ali ;

Charton, Beatrice ;

Klein, Jean-Claude .

IMAGE ANALYSIS & STEREOLOGY, 2014, 33 (03) :231-234

← 1 2 3 4 5 →