Effectiveness of encoder-decoder deep learning approach for colorectal polyp segmentation in colonoscopy images

被引:1
作者
Hamza, Ameer [1 ]
Bilal, Muhammad [2 ,3 ]
Ramzan, Muhammad [4 ]
Malik, Nadia [4 ,5 ]
机构
[1] Univ Sargodha, Fac Comp & IT, Dept Comp Sci, Sargodha 40100, Pakistan
[2] Univ Florida, Dept Pharmaceut Outcomes & Policy, Gainesville, FL 32610 USA
[3] Natl Univ Comp & Emerging Sci, Dept Software Engn, Islamabad 44000, Pakistan
[4] Univ Sargodha, Fac Comp & Informat Technol, Dept Software Engn, Sargodha 40100, Pakistan
[5] COMSATS Univ Islamabad, Dept Management Sci, Islamabad 45550, Pakistan
关键词
Medical Image Segmentation; Semantic Segmentation; Polyp Segmentation; Deep Learning; Kvasir-SEG; CVC-ClinicDB; MISS RATE; NETWORK;
D O I
10.1007/s10489-024-06167-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Colorectal cancer is considered one of the deadliest diseases, contributing to an alarming increase in annual deaths worldwide, with colorectal polyps recognized as precursors to this malignancy. Early and accurate detection of these polyps is crucial for reducing the mortality rate of colorectal cancer. However, the manual detection of polyps is a time-consuming process and requires the expertise of trained medical professionals. Moreover, it often misses polyps due to their varied size, color, and texture. Computer-aided diagnosis systems offer potential improvements, but they often struggle with precision in complex visual environments. This study presents an enhanced deep learning approach using encoder-decoder architecture for colorectal polyp segmentation to capture and utilize complex feature representations. Our approach introduces an enhanced dual attention mechanism, combining spatial and channel-wise attention to focus precisely on critical features. Channel-wise attention, implemented via an optimized Squeeze-and-Excitation (S&E) block, allows the network to capture comprehensive contextual information and interrelationships among different channels, ensuring a more refined feature selection process. The experimental results showed that the proposed model achieved a mean Intersection over Union (IoU) of 0.9054 and 0.9277, a dice coefficient of 0.9006 and 0.9128, a precision of 0.8985 and 0.9517, a recall of 0.9190 and 0.9094, and an accuracy of 0.9806 and 0.9907 on the Kvasir-SEG and CVC-ClinicDB datasets, respectively. Moreover, the proposed model outperforms the existing state-of-the-art resulting in improved patient outcomes with the potential to enhance the early detection of colorectal polyps.
引用
收藏
页数:24
相关论文
共 50 条
[41]   Benchmark of Deep Encoder-Decoder Architectures for Head and Neck Tumor Segmentation in Magnetic Resonance Images: Contribution to the HNTSMRG Challenge [J].
Wodzinski, Marek .
HEAD AND NECK TUMOR SEGMENTATION FOR MR-GUIDED APPLICATIONS, HNTS-MRG 2024, 2025, 15273 :204-213
[42]   Deep encoder-decoder networks for belt longitudinal tear detection [J].
You, Lei ;
Luo, Minghua ;
Zhu, Xinglin ;
Zhou, Bin .
MEASUREMENT & CONTROL, 2025, 58 (05) :643-655
[43]   Image Segmentation using Encoder-Decoder Architecture and Region Consistency Activation [J].
Naik, Dinesh ;
Jaidhar, C. D. .
2016 11TH INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2016, :724-729
[44]   Encoder-Decoder Structure Fusing Depth Information for Outdoor Semantic Segmentation [J].
Chen, Songnan ;
Tang, Mengxia ;
Dong, Ruifang ;
Kan, Jiangming .
APPLIED SCIENCES-BASEL, 2023, 13 (17)
[45]   Ensemble of Instance Segmentation Models for Polyp Segmentation in Colonoscopy Images [J].
Kang, Jaeyong ;
Gwak, Jeonghwan .
IEEE ACCESS, 2019, 7 :26440-26447
[46]   Optimized encoder-decoder cascaded deep convolutional network for leaf disease image segmentation [J].
Femi, David ;
Mukunthan, Manapakkam Anandan .
NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2025, 36 (03) :480-506
[47]   An approach of polyp segmentation from colonoscopy images using Dilated-U-Net-Seg - A deep learning network [J].
Karthikha, R. ;
Jamal, D. Najumnissa ;
Rafiammal, S. Syed .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 93
[48]   Deep Learning-Based Short Text Summarization: An Integrated BERT and Transformer Encoder-Decoder Approach [J].
Ghanem, Fahd A. ;
Padma, M. C. ;
Abdulwahab, Hudhaifa M. ;
Alkhatib, Ramez .
COMPUTATION, 2025, 13 (04)
[49]   Lightweight Deep Learning Model for Real-Time Colorectal Polyp Segmentation [J].
Jeong, Seung-Min ;
Lee, Seung-Gun ;
Seok, Chae-Lin ;
Lee, Eui-Chul ;
Lee, Jun-Young .
ELECTRONICS, 2023, 12 (09)
[50]   Automatic colon polyp detection using Convolutional Encoder-Decoder model [J].
Bardhi, Ornela ;
Sierra-Sosa, Daniel ;
Garcia-Zapirain, Begonya ;
Elmaghraby, Adel .
2017 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2017, :445-448