Application of Multilayer Information Fusion and Optimization Network Combined With Attention Mechanism in Polyp Segmentation

被引:0
作者
Chu, Jinghui [1 ]
Wang, Yongpeng [1 ]
Tian, Qi [2 ]
Lu, Wei [1 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Tianjin Childrens Hosp, Tianjin 300204, Peoples R China
关键词
Feature extraction; Decoding; Transformers; Attention mechanisms; Semantics; Colonoscopy; Convolution; Accuracy; Optimization; Noise; Colorectal cancer (CRC); contextual feature process; multiscale attention mechanism; polyp boundaries refinement; polyp segmentation;
D O I
10.1109/TIM.2025.3527621
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Colorectal cancer (CRC) is a multifaceted disease, but it can be effectively prevented through colonoscopy for the detection of polyps. In clinical practice, the development of automatic polyp segmentation techniques for colonoscopy images can significantly enhance the efficiency and accuracy of polyp detection and help clinicians to precisely localize the polyps. However, existing segmentation methods have several obvious limitations: 1) inadequate utilization of multilevel features extracted by feature encoders; 2) ineffective aggregation of high- and low-level features; and 3) unclear delineation of polyp boundaries. To address these challenges while enhancing the clarity of polyp boundaries in segmentation, we propose a novel multilayer information fusion and optimization network (MIFONet) consisting of the following components: 1) contextual and fine feature processing (CFFP) module, employed to effectively extract both local and global contextual information; 2) hierarchical feature integration module (HFIM), added to facilitate efficient aggregation of processed high- and low-level features and strengthen the association between contextual features; 3) multiscale contextual attention (MSCA) module, used to deeply integrate aggregated high-level features with low-level features; and 4) a novel refinement module composed of an adaptive channel attention pyramid (ACAP) part and a skip-reverse attention (SRA) part, with the ability to capture fine-grained information and refining feature representation. We conducted extensive experiments and comparative analysis of our proposed model with 19 popular or state-of-the-art (SOTA) methods on five renowned polyp benchmark datasets. To further validate the model's generalization performance, we also designed three cross-dataset experiments. Experimental results demonstrate that MIFONet consistently achieves excellent segmentation performance across most datasets. In particular, we achieve 94.6% mean Dice on the CVC-ClinicDB dataset, which obtains superior performance compared with SOTA methods.
引用
收藏
页数:15
相关论文
共 65 条
[31]   Multi-level feature fusion network combining attention mechanisms for polyp segmentation [J].
Liu, Junzhuo ;
Chen, Qiaosong ;
Zhang, Ye ;
Wang, Zhixiang ;
Deng, Xin ;
Wang, Jin .
INFORMATION FUSION, 2024, 104
[32]   Attention combined pyramid vision transformer for polyp segmentation [J].
Liu, Xiaogang ;
Song, Shuang .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
[33]  
Long J, 2015, PROC CVPR IEEE, P3431, DOI 10.1109/CVPR.2015.7298965
[34]   CaraNet: Context Axial Reverse Attention Network for Segmentation of Small Medical Objects [J].
Lou, Ange ;
Guan, Shuyue ;
Ko, Hanseok ;
Loew, Murray .
MEDICAL IMAGING 2022: IMAGE PROCESSING, 2022, 12032
[35]   CFPNET: CHANNEL-WISE FEATURE PYRAMID FOR REAL-TIME SEMANTIC SEGMENTATION [J].
Lou, Ange ;
Loew, Murray .
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, :1894-1898
[36]   How to Evaluate Foreground Maps? [J].
Margolin, Ran ;
Zelnik-Manor, Lihi ;
Tal, Ayellet .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :248-255
[37]  
Nguyen T.-C., 2021, MED IM COMP COMP ASS
[38]   ColonFormer: An Efficient Transformer Based Method for Colon Polyp Segmentation [J].
Nguyen Thanh Duc ;
Nguyen Thi Oanh ;
Nguyen Thi Thuy ;
Tran Minh Triet ;
Dinh Viet Sang .
IEEE ACCESS, 2022, 10 :80575-80586
[39]  
Perazzi F, 2012, PROC CVPR IEEE, P733, DOI 10.1109/CVPR.2012.6247743
[40]   Medical Image Segmentation via Cascaded Attention Decoding [J].
Rahman, Md Mostafijur ;
Marculescu, Radu .
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, :6211-6220