EPSegNet: Lightweight Semantic Recalibration and Assembly for Efficient Polyp Segmentation

被引：0

作者：

Wu, Huisi ^{[1
]}

Zhao, Zebin ^{[1
]}

机构：

[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2025年

基金：

中国国家自然科学基金;

关键词：

Feature extraction; Semantics; Accuracy; Decoding; Shape; Data mining; Semantic segmentation; Colonoscopy; Context modeling; Computational modeling; Encoder-decoder; lightweight; polyp segmentation; semantic assembly; semantic recalibration; NETWORK; DIAGNOSIS; CNN;

D O I：

10.1109/TNNLS.2025.3527557

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Colorectal cancer (CRC) is among the most common malignancies and the detection and removal of polyps at the early stage is of great importance to prevent it. However, current state-of-the-art high-accuracy methods for polyps segmentation have a large number of parameters and a stringent requirement for computational cost, while lightweight and fast models significantly sacrifice accuracy. Currently, medical semantic segmentation algorithms are mostly based on encoder-decoder architecture. Pixelwise spatial information has been proven to be very important to the quality of features extracted by encoders. However, almost all existing approaches capturing it suffer from high computational complexity. Furthermore, the capacity of the traditional decoder is limited by its limited receptive fields. To comprehensively address the above problems, we propose a novel efficient polyp segmentation network (EPSegNet) to simultaneously fulfill the requirements of accuracy, size, and speed. First, we propose a lightweight feature extraction and recalibration module (LFERM), which can efficiently extract dense multiscale features. Specifically, in LFERM, we propose a spatial information recalibration (SIR) block for efficiently refining spatial information. Based on LFERMs, we develop an encoder. Moreover, we propose a novel lightweight semantic assembly decoder (LSAD) that assembles both channelwise and pixelwise semantics from a global context view. Finally, we combine the encoder and LSAD to form the proposed EPSegNet. Experiments on Kvasir-SEG, CVC-ClinicDB, and CVC-ColonDB datasets demonstrate that the proposed EPSegNet achieves the best balance between accuracy and size among state-of-the-art models and obtains a fast speed for polyp segmentation. Without any pretraining and postprocessing, our method achieves 79.37% intersection over union (IoU) and 86.74% Dice on the Kvasie-SEG dataset with only 0.34 million parameters and a speed of 128 frames/s (FPS) at the input size of 3 x 384 x 384 on a single NVIDIA GEFORCE RTX 2080Ti card. Codes will be released upon publication.

引用

页数：13

共 64 条

[1] WM-DOVA maps for accurate polyp highlighting in colonoscopy: Validation vs. saliency maps from physicians
Bernal, Jorge
Javier Sanchez, F.
Fernandez-Esparrach, Gloria
Gil, Debora
Rodriguez, Cristina
Vilarino, Fernando
[J]. COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2015, 43 : 99 - 111
[2] Bo D., 2023, Polyp-pvt: Polyp segmentation with pyramidvision transformers
[3] Fully Convolutional Neural Networks for Polyp Segmentation in Colonoscopy
Brandao, Patrick
Mazomenos, Evangelos
Ciuti, Gastone
Calio, Renato
Bianchi, Federico
Menciassi, Arianna
Dario, Paolo
Koulaouzidis, Anastasios
Arezzo, Alberto
Stoyanov, Danail
[J]. MEDICAL IMAGING 2017: COMPUTER-AIDED DIAGNOSIS, 2017, 10134
[4] Chen LC, 2017, Arxiv, DOI [arXiv:1706.05587, DOI 10.48550/ARXIV.1706.05587]
[5] Deng-Ping Fan, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12266), P263, DOI 10.1007/978-3-030-59725-2_26
[6] Linear Deconfounded Score Method: Scoring DAGs With Dense Unobserved Confounding
Bellot, Alexis
van der Schaar, Mihaela
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 4948 - 4962
[7] CPFNet: Context Pyramid Fusion Network for Medical Image Segmentation
Feng, Shuanglang
Zhao, Heming
Shi, Fei
Cheng, Xuena
Wang, Meng
Ma, Yuhui
Xiang, Dehui
Zhu, Weifang
Chen, Xinjian
[J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (10) : 3008 - 3018
[8] Scene Segmentation With Dual Relation-Aware Attention Network
Fu, Jun
Liu, Jing
Jiang, Jie
Li, Yong
Bao, Yongjun
Lu, Hanqing
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (06) : 2547 - 2560
[9] Dual Attention Network for Scene Segmentation
Fu, Jun
Liu, Jing
Tian, Haijie
Li, Yong
Bao, Yongjun
Fang, Zhiwei
Lu, Hanqing
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3141 - 3149
[10] Howard AG, 2017, Arxiv, DOI [arXiv:1704.04861, DOI 10.48550/ARXIV.1704.04861]

← 1 2 3 4 5 6 7 →