MediDRNet: Tackling category imbalance in diabetic retinopathy classification with dual-branch learning and prototypical contrastive learning

被引:1
作者
Teng, Siying [1 ]
Wang, Bo [2 ]
Yang, Feiyang [3 ]
Yi, Xingcheng [4 ]
Zhang, Xinmin [5 ]
Sun, Yabin [1 ]
机构
[1] First Hosp Jilin Univ, Dept Ophthalmol, Changchun 130021, Jilin, Peoples R China
[2] Univ Minho, P-4710057 Braga, Braga District, Portugal
[3] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Jilin, Peoples R China
[4] First Hosp Jilin Univ, Lab Canc Precis Med, Changchun 130013, Jilin, Peoples R China
[5] Jilin Univ, Sch Pharmaceut Sci, Dept Regenerat Med, Changchun 130021, Jilin, Peoples R China
关键词
Diabetic retinopathy; Imbalanced medical image classification; Prototypical supervised contrastive learning; Dual-branch network; Convolutional block attention module;
D O I
10.1016/j.cmpb.2024.108230
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and Objective: The classification of diabetic retinopathy (DR) aims to utilize the implicit information in images for early diagnosis, to prevent and mitigate the further worsening of the condition. However, existing methods are often limited by the need to operate within large, annotated datasets to show significant advantages. Additionally, the number of samples for different categories within the dataset needs to be evenly distributed, because the characteristic of sample imbalance distribution can lead to an excessive focus on high -frequency disease categories, while neglecting the less common but equally important disease categories. Therefore, there is an urgent need to develop a new classification method that can effectively alleviate the issue of sample distribution imbalance, thereby enhancing the accuracy of diabetic retinopathy classification. Methods: In this work, we propose MediDRNet, a dual -branch network model based on prototypical contrastive learning. This model adopts prototype contrastive learning, creating prototypes for different levels of lesions, ensuring they represent the core features of each lesion level. It classifies by comparing the similarity between data points and their category prototypes. Our dual -branch network structure effectively resolves the issue of category imbalance and improves classification accuracy by emphasizing subtle differences in retinal lesions. Moreover, our approach combines a dual -branch network with specific lesion -level prototypes for core feature representation and incorporates the convolutional block attention module for enhanced lesion feature identification. Results: Our experiments using both the Kaggle and UWF classification datasets have demonstrated that MediDRNet exhibits exceptional performance compared to other advanced models in the industry, especially on the UWF DR classification dataset where it achieved state-of-the-art performance across all metrics. On the Kaggle DR classification dataset, it achieved the highest average classification accuracy (0.6327) and Macro -F1 score (0.6361). Particularly in the classification tasks for minority categories of diabetic retinopathy on the Kaggle dataset (Grades 1, 2, 3, and 4), the model reached high classification accuracies of 58.08%, 55.32%, 69.73%, and 90.21%, respectively. In the ablation study, the MediDRNet model proved to be more effective in feature extraction from diabetic retinal fundus images compared to other feature extraction methods. Conclusions: This study employed prototype contrastive learning and bidirectional branch learning strategies, successfully constructing a grading system for diabetic retinopathy lesions within imbalanced diabetic retinopathy datasets. Through a dual -branch network, the feature learning branch effectively facilitated a smooth transition of features from the grading network to the classification learning branch, accurately identifying minority sample categories. This method not only effectively resolved the issue of sample imbalance but also provided strong support for the precise grading and early diagnosis of diabetic retinopathy in clinical applications, showcasing exceptional performance in handling complex diabetic retinopathy datasets. Moreover, this research significantly improved the efficiency of prevention and management of disease
引用
收藏
页数:10
相关论文
共 50 条
[41]   Research Progress on Deep Learning in Field of Diabetic Retinopathy Classification [J].
Sun, Shilei ;
Li, Ming ;
Liu, Jing ;
Ma, Jingang ;
Chen, Tianzhen .
Computer Engineering and Applications, 2024, 60 (08) :16-30
[42]   Deep CNNs for Diabetic Retinopathy Classification: A Transfer Learning Perspective [J].
Baskar, Ruthran ;
Sabu, Emmanuel ;
Mazo, Claudia .
IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI 2024, 2024,
[43]   Dual-branch interactive cross-frequency attention network for deep feature learning [J].
Li, Qiufu ;
Shen, Linlin .
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 254
[44]   Lesion-Based Contrastive Learning for Diabetic Retinopathy Grading from Fundus Images [J].
Huang, Yijin ;
Lin, Li ;
Cheng, Pujin ;
Lyu, Junyan ;
Tang, Xiaoying .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT II, 2021, 12902 :113-123
[45]   An Interpretable Ensemble Deep Learning Model for Diabetic Retinopathy Disease Classification [J].
Jiang, Hongyang ;
Yang, Kang ;
Gao, Mengdi ;
Zhang, Dongdong ;
Ma, He ;
Qian, Wei .
2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, :2045-2048
[46]   Evolutionary Intelligence and Deep Learning Enabled Diabetic Retinopathy Classification Model [J].
Alqaralleh, Bassam A. Y. ;
Aldhaban, Fahad ;
Abukaraki, Anas ;
AlQaralleh, Esam A. .
CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (01) :86-100
[47]   Advancing diabetic retinopathy classification using ensemble deep learning approaches [J].
Biswas, Ankur ;
Banik, Rita .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 106
[48]   Deep Learning for the Detection and Classification of Diabetic Retinopathy with an Improved Activation Function [J].
Bhimavarapu, Usharani ;
Battineni, Gopi .
HEALTHCARE, 2023, 11 (01)
[49]   A Novel Transformer Model With Multiple Instance Learning for Diabetic Retinopathy Classification [J].
Yang, Yaoming ;
Cai, Zhili ;
Qiu, Shuxia ;
Xu, Peng .
IEEE ACCESS, 2024, 12 :6768-6776
[50]   A Classification Method for Diabetic Retinopathy Based on Self-supervised Learning [J].
Long, Fei ;
Xiong, Haoren ;
Sang, Jun .
ADVANCED INTELLIGENT COMPUTING IN BIOINFORMATICS, PT I, ICIC 2024, 2024, 14881 :347-357