Effectiveness of encoder-decoder deep learning approach for colorectal polyp segmentation in colonoscopy images

被引:1
作者
Hamza, Ameer [1 ]
Bilal, Muhammad [2 ,3 ]
Ramzan, Muhammad [4 ]
Malik, Nadia [4 ,5 ]
机构
[1] Univ Sargodha, Fac Comp & IT, Dept Comp Sci, Sargodha 40100, Pakistan
[2] Univ Florida, Dept Pharmaceut Outcomes & Policy, Gainesville, FL 32610 USA
[3] Natl Univ Comp & Emerging Sci, Dept Software Engn, Islamabad 44000, Pakistan
[4] Univ Sargodha, Fac Comp & Informat Technol, Dept Software Engn, Sargodha 40100, Pakistan
[5] COMSATS Univ Islamabad, Dept Management Sci, Islamabad 45550, Pakistan
关键词
Medical Image Segmentation; Semantic Segmentation; Polyp Segmentation; Deep Learning; Kvasir-SEG; CVC-ClinicDB; MISS RATE; NETWORK;
D O I
10.1007/s10489-024-06167-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Colorectal cancer is considered one of the deadliest diseases, contributing to an alarming increase in annual deaths worldwide, with colorectal polyps recognized as precursors to this malignancy. Early and accurate detection of these polyps is crucial for reducing the mortality rate of colorectal cancer. However, the manual detection of polyps is a time-consuming process and requires the expertise of trained medical professionals. Moreover, it often misses polyps due to their varied size, color, and texture. Computer-aided diagnosis systems offer potential improvements, but they often struggle with precision in complex visual environments. This study presents an enhanced deep learning approach using encoder-decoder architecture for colorectal polyp segmentation to capture and utilize complex feature representations. Our approach introduces an enhanced dual attention mechanism, combining spatial and channel-wise attention to focus precisely on critical features. Channel-wise attention, implemented via an optimized Squeeze-and-Excitation (S&E) block, allows the network to capture comprehensive contextual information and interrelationships among different channels, ensuring a more refined feature selection process. The experimental results showed that the proposed model achieved a mean Intersection over Union (IoU) of 0.9054 and 0.9277, a dice coefficient of 0.9006 and 0.9128, a precision of 0.8985 and 0.9517, a recall of 0.9190 and 0.9094, and an accuracy of 0.9806 and 0.9907 on the Kvasir-SEG and CVC-ClinicDB datasets, respectively. Moreover, the proposed model outperforms the existing state-of-the-art resulting in improved patient outcomes with the potential to enhance the early detection of colorectal polyps.
引用
收藏
页数:24
相关论文
共 50 条
[31]   Polyp Segmentation in Colonoscopy Images [J].
Ferreira, Marcio P. ;
Freulon, Giulia de A. ;
Piorsky, Daniel G. ;
Pessoa, Alexandre C. P. ;
Quintanilha, Darlan B. P. ;
Silva, Aristofanes C. .
WIRELESS MOBILE COMMUNICATION AND HEALTHCARE, MOBIHEALTH 2023, 2024, 578 :185-194
[32]   Plant leaf infected spot segmentation using robust encoder-decoder cascaded deep learning model [J].
Femi, Dev ;
Mukunthan, Manoj Ananad .
NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2025, 36 (03) :407-425
[33]   Attention Based Encoder-decoder Network for Cardiac Semantic Segmentation [J].
Yuan, Xiaohan ;
Zhu, Yinsu ;
Wang, Yangang .
2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, :4578-4582
[34]   Deep Encoder-Decoder Network-Based Wildfire Segmentation Using Drone Images in Real-Time [J].
Muksimova, Shakhnoza ;
Mardieva, Sevara ;
Cho, Young-Im .
REMOTE SENSING, 2022, 14 (24)
[35]   A survey of deep learning algorithms for colorectal polyp segmentation [J].
Li, Sheng ;
Ren, Yipei ;
Yu, Yulin ;
Jiang, Qianru ;
He, Xiongxiong ;
Li, Hongzhang .
NEUROCOMPUTING, 2025, 614
[36]   CoAtUNet: A symmetric encoder-decoder with hybrid transformers for semantic segmentation of breast ultrasound images [J].
Zaidkilani, Nadeem ;
Garcia, Miguel Angel ;
Puig, Domenec .
NEUROCOMPUTING, 2025, 629
[37]   Comparison of Encoder-Decoder Networks for Soccer Field Segmentation [J].
Guimaraes, Otavio H. R. ;
Maximo, Marcos R. O. A. ;
Parente de Oliveira, Jose Maria .
2023 LATIN AMERICAN ROBOTICS SYMPOSIUM, LARS, 2023 BRAZILIAN SYMPOSIUM ON ROBOTICS, SBR, AND 2023 WORKSHOP ON ROBOTICS IN EDUCATION, WRE, 2023, :496-501
[38]   Encoder-decoder with double spatial pyramid for semantic segmentation [J].
Kong, Huifang ;
Hu, Jie ;
Fan, Lei ;
Zhang, Xiaoxue ;
Fang, Yao .
JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (06)
[39]   Lightweight Encoder-Decoder Architecture for Foot Ulcer Segmentation [J].
Ali, Shahzad ;
Mahmood, Arif ;
Jung, Soon Ki .
FRONTIERS OF COMPUTER VISION (IW-FCV 2022), 2022, 1578 :242-253
[40]   Semantic road segmentation using encoder-decoder architectures [J].
Latsaheb B. ;
Sharma S. ;
Hasija S. .
Multimedia Tools and Applications, 2025, 84 (9) :5961-5983