SR-AttNet: An Interpretable Stretch-Relax Attention based Deep Neural Network for Polyp Segmentation in Colonoscopy Images

被引:12
作者
Alam, Md. Jahin [1 ]
Fattah, Shaikh Anowarul [1 ,2 ]
机构
[1] Bangladesh Univ Engn & Technol BUET, Dept Elect & Elect Engn, Dhaka 1205, Bangladesh
[2] Bangladesh Univ Engn & Technol BUET, Dept Elect & Elect Engn, Dhaka, Bangladesh
关键词
Polyp segmentation; Attention; Colonoscopy; Deep learning; CNN; Interpretable; U-NET ARCHITECTURE; DIAGNOSIS; COVID-19; MODEL;
D O I
10.1016/j.compbiomed.2023.106945
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Colorectal polyp is a common structural gastrointestinal (GI) anomaly, which can in certain cases turn malignant. Colonoscopic image inspection is, thereby, an important step for isolating the polyps as well as removing them if necessary. However, the process is around 30-60 min long and inspecting each image for polyps can prove to be a tedious task. Hence, an automatic computerized process for efficient and accurate polyp isolation can be a useful tool.Methods: In this study, a deep learning network is introduced for colorectal polyp segmentation. The network is based on an encoder-decoder architecture, however, having both un-dilated and dilated filtering in order to extract both near and far local information as well as perceive image depth. Four-fold skip-connections exist between each spatial encoder-decoder due to both type of filtering and a 'Feature-to-Mask' pipeline processes the decoded dilated and un-dilated features for final prediction. The proposed network implements a 'Stretch- Relax' based attention system, SR-Attention, to generate high variance spatial features in order to obtain useful attention masks for cognitive feature selection. From this 'Stretch-Relax' attention based operation, the network is termed as 'SR-AttNet'.Results: Training and optimization is performed on four different datasets, and inference has been done on five (Kvasir-SEG, CVC-ClinicDB, CVC-Colon, ETIS-Larib, EndoCV2020); all of which output higher Dice-score compared to state-of-the-art and existing networks. The efficacy and interpretability of SR-Attention is also demonstrated based on quantitative variance.Conclusion: In consequence, the proposed SR-AttNet can be considered for an automated and general approach for polyp segmentation during colonoscopy.
引用
收藏
页数:16
相关论文
共 55 条
[51]   Focus U-Net: A novel dual attention-gated CNN for polyp segmentation during colonoscopy [J].
Yeung, Michael ;
Sala, Evis ;
Schonlieb, Carola-Bibiane ;
Rundo, Leonardo .
COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 137
[52]   Automated polyp segmentation in colonoscopy images via deep network with lesion-aware feature selection and refinement [J].
Yue, Guanghui ;
Han, Wanwan ;
Li, Siying ;
Zhou, Tianwei ;
Lv, Jun ;
Wang, Tianfu .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 78
[53]   Road Extraction by Deep Residual U-Net [J].
Zhang, Zhengxin ;
Liu, Qingjie ;
Wang, Yunhong .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2018, 15 (05) :749-753
[54]   Pyramid Scene Parsing Network [J].
Zhao, Hengshuang ;
Shi, Jianping ;
Qi, Xiaojuan ;
Wang, Xiaogang ;
Jia, Jiaya .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6230-6239
[55]  
Zhou Zongwei, 2018, Deep Learn Med Image Anal Multimodal Learn Clin Decis Support (2018), V11045, P3, DOI [10.1007/978-3-030-00889-5_1, 10.1007/978-3-030-00689-1_1]