Application of Multilayer Information Fusion and Optimization Network Combined With Attention Mechanism in Polyp Segmentation

被引:0
作者
Chu, Jinghui [1 ]
Wang, Yongpeng [1 ]
Tian, Qi [2 ]
Lu, Wei [1 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Tianjin Childrens Hosp, Tianjin 300204, Peoples R China
关键词
Feature extraction; Decoding; Transformers; Attention mechanisms; Semantics; Colonoscopy; Convolution; Accuracy; Optimization; Noise; Colorectal cancer (CRC); contextual feature process; multiscale attention mechanism; polyp boundaries refinement; polyp segmentation;
D O I
10.1109/TIM.2025.3527621
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Colorectal cancer (CRC) is a multifaceted disease, but it can be effectively prevented through colonoscopy for the detection of polyps. In clinical practice, the development of automatic polyp segmentation techniques for colonoscopy images can significantly enhance the efficiency and accuracy of polyp detection and help clinicians to precisely localize the polyps. However, existing segmentation methods have several obvious limitations: 1) inadequate utilization of multilevel features extracted by feature encoders; 2) ineffective aggregation of high- and low-level features; and 3) unclear delineation of polyp boundaries. To address these challenges while enhancing the clarity of polyp boundaries in segmentation, we propose a novel multilayer information fusion and optimization network (MIFONet) consisting of the following components: 1) contextual and fine feature processing (CFFP) module, employed to effectively extract both local and global contextual information; 2) hierarchical feature integration module (HFIM), added to facilitate efficient aggregation of processed high- and low-level features and strengthen the association between contextual features; 3) multiscale contextual attention (MSCA) module, used to deeply integrate aggregated high-level features with low-level features; and 4) a novel refinement module composed of an adaptive channel attention pyramid (ACAP) part and a skip-reverse attention (SRA) part, with the ability to capture fine-grained information and refining feature representation. We conducted extensive experiments and comparative analysis of our proposed model with 19 popular or state-of-the-art (SOTA) methods on five renowned polyp benchmark datasets. To further validate the model's generalization performance, we also designed three cross-dataset experiments. Experimental results demonstrate that MIFONet consistently achieves excellent segmentation performance across most datasets. In particular, we achieve 94.6% mean Dice on the CVC-ClinicDB dataset, which obtains superior performance compared with SOTA methods.
引用
收藏
页数:15
相关论文
共 65 条
[1]   WM-DOVA maps for accurate polyp highlighting in colonoscopy: Validation vs. saliency maps from physicians [J].
Bernal, Jorge ;
Javier Sanchez, F. ;
Fernandez-Esparrach, Gloria ;
Gil, Debora ;
Rodriguez, Cristina ;
Vilarino, Fernando .
COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2015, 43 :99-111
[2]   Fully Convolutional Neural Networks for Polyp Segmentation in Colonoscopy [J].
Brandao, Patrick ;
Mazomenos, Evangelos ;
Ciuti, Gastone ;
Calio, Renato ;
Bianchi, Federico ;
Menciassi, Arianna ;
Dario, Paolo ;
Koulaouzidis, Anastasios ;
Arezzo, Alberto ;
Stoyanov, Danail .
MEDICAL IMAGING 2017: COMPUTER-AIDED DIAGNOSIS, 2017, 10134
[3]  
Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
[4]  
Chang Qi, 2023, Proceedings of SPIE - Progress in Biomedical Optics and Imaging, DOI 10.1117/12.2647897
[5]   Camouflaged Object Detection via Context-Aware Cross-Level Fusion [J].
Chen, Geng ;
Liu, Si-Jie ;
Sun, Yu-Jia ;
Ji, Ge-Peng ;
Wu, Ya-Feng ;
Zhou, Tao .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) :6981-6993
[6]  
Chen Jieneng, 2021, arXiv:2102.04306
[7]   Reverse Attention for Salient Object Detection [J].
Chen, Shuhan ;
Tan, Xiuli ;
Wang, Ben ;
Hu, Xuelong .
COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 :236-252
[8]   EFFICIENT POLYP SEGMENTATION VIA INTEGRITY LEARNING [J].
Chen, Ziqiang ;
Wang, Kang ;
Liu, Yun .
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, :1826-1830
[9]  
Deng-Ping Fan, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12266), P263, DOI 10.1007/978-3-030-59725-2_26
[10]  
Dong B, 2024, Arxiv, DOI [arXiv:2108.06932, 10.26599/AIR.2023.9150015, DOI 10.48550/ARXIV.2108.06932]