Cross-Scale Fuzzy Holistic Attention Network for Diabetic Retinopathy Grading From Fundus Images

被引：2

作者：

Lin, Zhijie ^{[1
,2
]}

He, Zhaoshui ^{[1
,2
]}

Wang, Xu ^{[3
]}

Su, Wenqing ^{[1
,4
]}

Tan, Ji ^{[1
,4
]}

Deng, Yamei ^{[5
]}

Xie, Shengli ^{[1
,4
]}

机构：

[1] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Peoples R China

[2] Guangdong Prov Key Lab Intelligent Syst & Optimiza, Guangzhou 510006, Peoples R China

[3] Guangdong Mech & Elect Polytech, Sch Elect & Commun, Guangzhou 510550, Peoples R China

[4] Minist Educ, Key Lab IoT Intelligent Informat Proc & Syst Integ, Guangzhou 510006, Peoples R China

[5] Guangzhou Med Univ, Affiliated Hosp 3, Dept Radiol, Guangzhou 510150, Peoples R China

来源：

IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2025年 / 9卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Lesions; Feature extraction; Retina; Deep learning; Uncertainty; Solid modeling; Medical diagnostic imaging; Support vector machines; Visual impairment; Interference; Fuzzy deep learning; diabetic retinopathy grading; attention network; fundus image; computer-aided diagnosis (CAD);

D O I：

10.1109/TETCI.2025.3543361

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Diabetic Retinopathy (DR) is one of the leading causes of visual impairment and blindness in diabetic patients worldwide. Accurate Computer-Aided Diagnosis (CAD) systems can aid in the early diagnosis and treatment of DR patients to reduce the risk of vision loss, but it remains challenging due to the following reasons: 1) the relatively low contrast and ambiguous boundaries between pathological lesions and normal retinal regions, and 2) the considerable diversity in lesion size and appearance. In this paper, a Cross-Scale Fuzzy Holistic Attention Network (CSFHANet) is proposed for DR grading using fundus images, and it consists of two main components: Fuzzy-Enhanced Holistic Attention (FEHA) and Fuzzy Learning-based Cross-Scale Fusion (FLCSF). FEHA is developed to adaptively recalibrate the importance of feature elements by assigning fuzzy weights across both channel and spatial domains, which can enhance the model's ability to learn the features of lesion regions while reducing the interference from irrelevant information in normal retinal regions. Then, the FLCSF module is designed to eliminate the uncertainty in fused multi-scale features derived from different branches by utilizing fuzzy membership functions, producing a more comprehensive and refined feature representation from complex DR lesions. Extensive experiments on the Messidor-2 and DDR datasets demonstrate that the proposed CSFHANet exhibits superior performance compared to state-of-the-art methods.

引用

页码：2164 / 2178

页数：15

共 68 条

[1] A Hybrid Convolutional Neural Network Model for Automatic Diabetic Retinopathy Classification From Fundus Images [J].

Ali, Ghulam ;

Dastgir, Aqsa ;

Iqbal, Muhammad Waseem ;

Anwar, Muhammad ;

Faheem, Muhammad .

IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE, 2023, 11 :341-350

[2] EDR-Net: Lightweight Deep Neural Network Architecture for Detecting Referable Diabetic Retinopathy [J].

Aujih, Ahmad Bukhari ;

Shapiai, Mohd Ibrahim ;

Meriaudeau, Fabrice ;

Tang, Tong Boon .

IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, 2022, 16 (03) :467-478

[3] Bayesian-neural-network-based strain estimation approach for optical coherence elastography [J].

Bai, Yulei ;

Zhang, Kangyang ;

Mo, Rui ;

Ni, Zihao ;

He, Zhaoshui ;

Xie, Shengli ;

Dong, Bo .

OPTICA, 2024, 11 (09) :1334-1345

[4] Features extraction using encoded local binary pattern for detection and grading diabetic retinopathy [J].

Berbar, Mohamed A. .

HEALTH INFORMATION SCIENCE AND SYSTEMS, 2022, 10 (01)

[5] An interpretable dual attention network for diabetic retinopathy grading: IDANet [J].

Bhati, Amit ;

Gour, Neha ;

Khanna, Pritee ;

Ojha, Aparajita ;

Werghi, Naoufel .

ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 149

[6] A deep learning interpretable classifier for diabetic retinopathy disease grading [J].

de la Torre, Jordi ;

Valls, Aida ;

Puig, Domenec .

NEUROCOMPUTING, 2020, 396 :465-476

[7] FEEDBACK ON A PUBLICLY DISTRIBUTED IMAGE DATABASE: THE MESSIDOR DATABASE [J].

Decenciere, Etienne ;

Zhang, Xiwei ;

Cazuguel, Guy ;

Lay, Bruno ;

Cochener, Beatrice ;

Trone, Caroline ;

Gain, Philippe ;

Ordonez-Varela, John-Richard ;

Massin, Pascale ;

Erginay, Ali ;

Charton, Beatrice ;

Klein, Jean-Claude .

IMAGE ANALYSIS & STEREOLOGY, 2014, 33 (03) :231-234

[8] FTransCNN: Fusing Transformer and a CNN based on fuzzy logic for uncertain medical image segmentation [J].

Ding, Weiping ;

Wang, Haipeng ;

Huang, Jiashuang ;

Ju, Hengrong ;

Geng, Yu ;

Lin, Chin-Teng ;

Pedrycz, Witold .

INFORMATION FUSION, 2023, 99

[9] Multimodal Infant Brain Segmentation by Fuzzy-Informed Deep Learning [J].

Ding, Weiping ;

Abdel-Basset, Mohamed ;

Hawash, Hossam ;

Pedrycz, Witold .

IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2022, 30 (04) :1088-1101

[10] Lip Image Segmentation Based on a Fuzzy Convolutional Neural Network [J].

Guan, Cheng ;

Wang, Shilin ;

Liew, Alan Wee-Chung .

IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (07) :1242-1251

← 1 2 3 4 5 6 7 →