A lighter hybrid feature fusion framework for polyp segmentation

被引:1
作者
He, Xue [1 ]
Luo, Yonggang [2 ]
Liu, Min [3 ]
Li, Lin [1 ]
机构
[1] Nanjing Med Univ, Dept Anesthesia Surg, Affiliated Huaian 1 Peoples Hosp, Huaian 223300, Peoples R China
[2] Nanjing Med Univ, Dept Cardiothorac Surg, Affiliated Huaian 1 Peoples Hosp, Huaian 223300, Peoples R China
[3] Nanjing Med Univ, Dept Lab Med, Affiliated Huaian 1 Peoples Hosp, Huaian 223300, Peoples R China
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
Polyp segmentation; Deep learning; Transformer; Generalization; NETWORK; CANCER; RISK;
D O I
10.1038/s41598-024-72763-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Colonoscopy is widely recognized as the most effective method for the detection of colon polyps, which is crucial for early screening of colorectal cancer. Polyp identification and segmentation in colonoscopy images require specialized medical knowledge and are often labor-intensive and expensive. Deep learning provides an intelligent and efficient approach for polyp segmentation. However, the variability in polyp size and the heterogeneity of polyp boundaries and interiors pose challenges for accurate segmentation. Currently, Transformer-based methods have become a mainstream trend for polyp segmentation. However, these methods tend to overlook local details due to the inherent characteristics of Transformer, leading to inferior results. Moreover, the computational burden brought by self-attention mechanisms hinders the practical application of these models. To address these issues, we propose a novel CNN-Transformer hybrid model for polyp segmentation (CTHP). CTHP combines the strengths of CNN, which excels at modeling local information, and Transformer, which excels at modeling global semantics, to enhance segmentation accuracy. We transform the self-attention computation over the entire feature map into the width and height directions, significantly improving computational efficiency. Additionally, we design a new information propagation module and introduce additional positional bias coefficients during the attention computation process, which reduces the dispersal of information introduced by deep and mixed feature fusion in the Transformer. Extensive experimental results demonstrate that our proposed model achieves state-of-the-art performance on multiple benchmark datasets for polyp segmentation. Furthermore, cross-domain generalization experiments show that our model exhibits excellent generalization performance.
引用
收藏
页数:13
相关论文
共 46 条
[1]  
Ba JL, 2016, arXiv
[2]   WM-DOVA maps for accurate polyp highlighting in colonoscopy: Validation vs. saliency maps from physicians [J].
Bernal, Jorge ;
Javier Sanchez, F. ;
Fernandez-Esparrach, Gloria ;
Gil, Debora ;
Rodriguez, Cristina ;
Vilarino, Fernando .
COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2015, 43 :99-111
[3]   Fully Convolutional Neural Networks for Polyp Segmentation in Colonoscopy [J].
Brandao, Patrick ;
Mazomenos, Evangelos ;
Ciuti, Gastone ;
Calio, Renato ;
Bianchi, Federico ;
Menciassi, Arianna ;
Dario, Paolo ;
Koulaouzidis, Anastasios ;
Arezzo, Alberto ;
Stoyanov, Danail .
MEDICAL IMAGING 2017: COMPUTER-AIDED DIAGNOSIS, 2017, 10134
[4]  
Chen J., 2021, arXiv
[5]  
Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
[6]   Improving deep learning-based polyp detection using feature extraction and data augmentation [J].
Chou, Yung-Chien ;
Chen, Chao-Chun .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (11) :16817-16837
[7]  
Dawwas MF, 2014, NEW ENGL J MED, V370, P2539, DOI [10.1056/NEJMc1405329, 10.1056/NEJMoa1309086]
[8]  
Deng-Ping Fan, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12266), P263, DOI 10.1007/978-3-030-59725-2_26
[9]   Sigmoid-weighted linear units for neural network function approximation in reinforcement learning [J].
Elfwing, Stefan ;
Uchibe, Eiji ;
Doya, Kenji .
NEURAL NETWORKS, 2018, 107 :3-11
[10]   Improved polyp detection from colonoscopy images using finetuned YOLO-v5 [J].
Ghose, Priyanka ;
Ghose, Arpan ;
Sadhukhan, Deboleena ;
Pal, Saurabh ;
Mitra, Madhuchanda .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (14) :42929-42954