UGS-M3F: unified gated swin transformer with multi-feature fully fusion for retinal blood vessel segmentation

被引:1
作者
Bakkouri, Ibtissam [1 ]
Bakkouri, Siham [2 ]
机构
[1] Sultan Moulay Slimane Univ, Lab LS2ME, Beni Mellal, Morocco
[2] Sultan Moulay Slimane Univ, TIAD Lab, Beni Mellal, Morocco
关键词
Retinal blood vessels; Fully fusion; Multi-feature; Swin transformer; Gated transformer; Multi-context feature; U-NET; CLASSIFICATION; IMAGES;
D O I
10.1186/s12880-025-01616-1
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Automated segmentation of retinal blood vessels in fundus images plays a key role in providing ophthalmologists with critical insights for the non-invasive diagnosis of common eye diseases. Early and precise detection of these conditions is essential for preserving vision, making vessel segmentation crucial for identifying vascular diseases that pose a threat to vision. However, accurately segmenting blood vessels in fundus images is challenging due to factors such as significant variability in vessel scale and appearance, occlusions, complex backgrounds, variations in image quality, and the intricate branching patterns of retinal vessels. To overcome these challenges, the Unified Gated Swin Transformer with Multi-Feature Full Fusion (UGS-M3F) model has been developed as a powerful deep learning framework tailored for retinal vessel segmentation. UGS-M3F leverages its Unified Multi-Context Feature Fusion (UM2F) and Gated Boundary-Aware Swin Transformer (GBS-T) modules to capture contextual information across different levels. The UM2F module enhances the extraction of detailed vessel features, while the GBS-T module emphasizes small vessel detection and ensures extensive coverage of large vessels. Extensive experimental results on publicly available datasets, including FIVES, DRIVE, STARE, and CHAS_DB1, show that UGS-M3F significantly outperforms existing state-of-the-art methods. Specifically, UGS-M3F achieves a Dice Coefficient (DC) improvement of 2.12% on FIVES, 1.94% on DRIVE, 2.52% on STARE, and 2.14% on CHAS_DB1 compared to the best-performing baseline. This improvement in segmentation accuracy has the potential to revolutionize diagnostic techniques, allowing for more precise disease identification and management across a range of ocular conditions.
引用
收藏
页数:20
相关论文
共 59 条
[1]   Multi-Layer Preprocessing and U-Net with Residual Attention Block for Retinal Blood Vessel Segmentation [J].
Alsayat, Ahmed ;
Elmezain, Mahmoud ;
Alanazi, Saad ;
Alruily, Meshrif ;
Mostafa, Ayman Mohamed ;
Said, Wael .
DIAGNOSTICS, 2023, 13 (21)
[2]   Width Attention based Convolutional Neural Network for Retinal Vessel Segmentation [J].
Alvarado-Carrillo, Dora E. ;
Dalmau-Cedeno, Oscar S. .
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 209
[3]   Advancements in Deep Learning for B-Mode Ultrasound Segmentation: A Comprehensive Review [J].
Ansari, Mohammed Yusuf ;
Mangalote, Iffa Afsa Changaai ;
Meher, Pramod Kumar ;
Aboumarzouk, Omar ;
Al-Ansari, Abdulla ;
Halabi, Osama ;
Dakua, Sarada Prasad .
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (03) :2126-2149
[4]   Unveiling the future of breast cancer assessment: a critical review on generative adversarial networks in elastography ultrasound [J].
Ansari, Mohammed Yusuf ;
Qaraqe, Marwa ;
Righetti, Raffaella ;
Serpedin, Erchin ;
Qaraqe, Khalid .
FRONTIERS IN ONCOLOGY, 2023, 13
[5]   Dense-PSP-UNet: A neural network for fast inference liver ultrasound segmentation [J].
Ansari, Mohammed Yusuf ;
Yang, Yin ;
Meher, Pramod Kumar ;
Dakua, Sarada Prasad .
COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 153
[6]   A lightweight neural network with multiscale feature enhancement for liver CT segmentation [J].
Ansari, MohammedYusuf ;
Yang, Yin ;
Balakrishnan, Shidin ;
Abinahed, Julien ;
Al-Ansari, Abdulla ;
Warfa, Mohamed ;
Almokdad, Omran ;
Barah, Ali ;
Omer, Ahmed ;
Singh, AjayVikram ;
Meher, Pramod Kumar ;
Bhadra, Jolly ;
Halabi, Osama ;
Azampour, Mohammad Farid ;
Navab, Nassir ;
Wendler, Thomas ;
Dakua, Sarada Prasad .
SCIENTIFIC REPORTS, 2022, 12 (01)
[7]   2MGAS-Net: multi-level multi-scale gated attentional squeezed network for polyp segmentation [J].
Bakkouri, Ibtissam ;
Bakkouri, Siham .
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (6-7) :5377-5386
[8]   Multi-scale CNN based on region proposals for efficient breast abnormality recognition [J].
Bakkouri, Ibtissam ;
Afdel, Karim .
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (10) :12939-12960
[9]   An adaptive CU size decision algorithm based on gradient boosting machines for 3D-HEVC inter-coding [J].
Bakkouri, Siham ;
Elyousfi, Abderrahmane .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (21) :32539-32557
[10]   Early Termination of CU Partition Based on Boosting Neural Network for 3D-HEVC Inter-Coding [J].
Bakkouri, Siham ;
Elyousfi, Abderrahmane .
IEEE ACCESS, 2022, 10 :13870-13883