Diffusion Probabilistic Learning With Gate-Fusion Transformer and Edge-Frequency Attention for Retinal Vessel Segmentation

被引:1
作者
Li, Yang [1 ]
Xu, Lingfu [1 ]
Jin, Yizhu [1 ]
Kuang, Xihe [2 ]
Zhang, Yue [2 ]
Cui, Weigang [2 ,3 ]
Zhang, Teng [2 ]
机构
[1] Beihang Univ, Dept Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[2] Univ Hong Kong, Fac Med, Dept Orthopaed & Traumatol, Hong Kong, Peoples R China
[3] Beihang Univ, Sch Engn Med, Beijing 100191, Peoples R China
基金
北京市自然科学基金; 国家杰出青年科学基金; 中国国家自然科学基金;
关键词
Image segmentation; Retinal vessels; Transformers; Image edge detection; Probabilistic logic; Noise measurement; Logic gates; Diffusion probabilistic model; edge-frequency attention; fundus image; transformer; vessel segmentation; ENCODER NETWORK; IMAGES;
D O I
10.1109/TIM.2024.3420264
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Retinal vessel topology provides unique biological information for the diagnosis of fundus diseases. However, most existing deep learning-based vessel segmentation methods mainly focus on global fundus structure, which may suffer from generalization errors and blurring caused by lesions and image noise. Besides, vessel edge details and feature channel information are generally ignored or not considered simultaneously, and this insufficiency commonly leads to suboptimal segmentation performance. To tackle these issues, we propose a novel diffusion probabilistic learning with gate-fusion transformer and edge-frequency attention (DPL-GFT-EFA) for retinal vessel segmentation. Specifically, the DPL leverages the image denoising as a proxy task to pretrain the segmentation model, which enhances the anti-interference ability by learning noise-related information. Then, the gate-fusion transformer (GFT) block fuses high-level representations from condition and diffusion encoders (DEs) with a gate mechanism, highlighting the mutual features between fundus patterns and noisy images. Finally, the edge-frequency attention (EFA) block is introduced to further consolidate the vessel edge details and discriminative channel features. We conduct the experiments on five public retinal image datasets, and achieve the accuracies of 97.05%, 97.70%, 97.71%, 97.16%, and 97.26% on DRIVE, STARE, CHASE_DB1, HRF, and IOSTAR datasets, respectively. These results demonstrate that the proposed method outperforms state-of-the-art models and achieve promising segmentation performance even in complex images containing fundus lesions and noise. Our source code is available at https://github.com/YangLibuaa/DPL-GTF-EFA.
引用
收藏
页数:13
相关论文
共 55 条
[1]  
Amit T, 2022, Arxiv, DOI arXiv:2112.00390
[2]   Prompt Deep Light-Weight Vessel Segmentation Network (PLVS-Net) [J].
Arsalan, Muhammad ;
Khan, Tariq M. ;
Naqvi, Syed Saud ;
Nawaz, Mehmood ;
Razzak, Imran .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (02) :1363-1371
[3]   A new supervised retinal vessel segmentation method based on robust hybrid features [J].
Aslani, Shahab ;
Sarnel, Haldun .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2016, 30 :1-12
[4]   Performance analysis of descriptive statistical features in retinal vessel segmentation via fuzzy logic, ANN, SVM, and classifier fusion [J].
Barkana, Buket D. ;
Saricicek, Inci ;
Yildirim, Burak .
KNOWLEDGE-BASED SYSTEMS, 2017, 118 :165-176
[5]  
Chen J., 2021, arXiv, DOI [DOI 10.48550/ARXIV.2102.04306, 10.48550/arXiv.2102.04306]
[6]  
Chen Liangjun, 2020, Med Image Comput Comput Assist Interv, V12267, P646, DOI 10.1007/978-3-030-59728-3_63
[7]   Deep Multiview Module Adaption Transfer Network for Subject-Specific EEG Recognition [J].
Cui, Weigang ;
Xiang, Yansong ;
Wang, Yifan ;
Yu, Tao ;
Liao, Xiao-Feng ;
Hu, Bin ;
Li, Yang .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (02) :2917-2930
[8]   Personalized Functional Connectivity Based Spatio-Temporal Aggregated Attention Network for MCI Identification [J].
Cui, Weigang ;
Ma, Yulan ;
Ren, Jianxun ;
Liu, Jingyu ;
Ma, Guolin ;
Liu, Hesheng ;
Li, Yang .
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2023, 31 :2257-2267
[9]   An Ensemble Classification-Based Approach Applied to Retinal Blood Vessel Segmentation [J].
Fraz, Muhammad Moazam ;
Remagnino, Paolo ;
Hoppe, Andreas ;
Uyyanonvara, Bunyarit ;
Rudnicka, Alicja R. ;
Owen, Christopher G. ;
Barman, Sarah A. .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2012, 59 (09) :2538-2548
[10]   CE-Net: Context Encoder Network for 2D Medical Image Segmentation [J].
Gu, Zaiwang ;
Cheng, Jun ;
Fu, Huazhu ;
Zhou, Kang ;
Hao, Huaying ;
Zhao, Yitian ;
Zhang, Tianyang ;
Gao, Shenghua ;
Liu, Jiang .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2019, 38 (10) :2281-2292