EPSegNet: Lightweight Semantic Recalibration and Assembly for Efficient Polyp Segmentation

被引:0
作者
Wu, Huisi [1 ]
Zhao, Zebin [1 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Semantics; Accuracy; Decoding; Shape; Data mining; Semantic segmentation; Colonoscopy; Context modeling; Computational modeling; Encoder-decoder; lightweight; polyp segmentation; semantic assembly; semantic recalibration; NETWORK; DIAGNOSIS; CNN;
D O I
10.1109/TNNLS.2025.3527557
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Colorectal cancer (CRC) is among the most common malignancies and the detection and removal of polyps at the early stage is of great importance to prevent it. However, current state-of-the-art high-accuracy methods for polyps segmentation have a large number of parameters and a stringent requirement for computational cost, while lightweight and fast models significantly sacrifice accuracy. Currently, medical semantic segmentation algorithms are mostly based on encoder-decoder architecture. Pixelwise spatial information has been proven to be very important to the quality of features extracted by encoders. However, almost all existing approaches capturing it suffer from high computational complexity. Furthermore, the capacity of the traditional decoder is limited by its limited receptive fields. To comprehensively address the above problems, we propose a novel efficient polyp segmentation network (EPSegNet) to simultaneously fulfill the requirements of accuracy, size, and speed. First, we propose a lightweight feature extraction and recalibration module (LFERM), which can efficiently extract dense multiscale features. Specifically, in LFERM, we propose a spatial information recalibration (SIR) block for efficiently refining spatial information. Based on LFERMs, we develop an encoder. Moreover, we propose a novel lightweight semantic assembly decoder (LSAD) that assembles both channelwise and pixelwise semantics from a global context view. Finally, we combine the encoder and LSAD to form the proposed EPSegNet. Experiments on Kvasir-SEG, CVC-ClinicDB, and CVC-ColonDB datasets demonstrate that the proposed EPSegNet achieves the best balance between accuracy and size among state-of-the-art models and obtains a fast speed for polyp segmentation. Without any pretraining and postprocessing, our method achieves 79.37% intersection over union (IoU) and 86.74% Dice on the Kvasie-SEG dataset with only 0.34 million parameters and a speed of 128 frames/s (FPS) at the input size of 3 x 384 x 384 on a single NVIDIA GEFORCE RTX 2080Ti card. Codes will be released upon publication.
引用
收藏
页数:13
相关论文
共 64 条
  • [1] WM-DOVA maps for accurate polyp highlighting in colonoscopy: Validation vs. saliency maps from physicians
    Bernal, Jorge
    Javier Sanchez, F.
    Fernandez-Esparrach, Gloria
    Gil, Debora
    Rodriguez, Cristina
    Vilarino, Fernando
    [J]. COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2015, 43 : 99 - 111
  • [2] Bo D., 2023, Polyp-pvt: Polyp segmentation with pyramidvision transformers
  • [3] Fully Convolutional Neural Networks for Polyp Segmentation in Colonoscopy
    Brandao, Patrick
    Mazomenos, Evangelos
    Ciuti, Gastone
    Calio, Renato
    Bianchi, Federico
    Menciassi, Arianna
    Dario, Paolo
    Koulaouzidis, Anastasios
    Arezzo, Alberto
    Stoyanov, Danail
    [J]. MEDICAL IMAGING 2017: COMPUTER-AIDED DIAGNOSIS, 2017, 10134
  • [4] Chen LC, 2017, Arxiv, DOI [arXiv:1706.05587, DOI 10.48550/ARXIV.1706.05587]
  • [5] Deng-Ping Fan, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12266), P263, DOI 10.1007/978-3-030-59725-2_26
  • [6] Linear Deconfounded Score Method: Scoring DAGs With Dense Unobserved Confounding
    Bellot, Alexis
    van der Schaar, Mihaela
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 4948 - 4962
  • [7] CPFNet: Context Pyramid Fusion Network for Medical Image Segmentation
    Feng, Shuanglang
    Zhao, Heming
    Shi, Fei
    Cheng, Xuena
    Wang, Meng
    Ma, Yuhui
    Xiang, Dehui
    Zhu, Weifang
    Chen, Xinjian
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (10) : 3008 - 3018
  • [8] Scene Segmentation With Dual Relation-Aware Attention Network
    Fu, Jun
    Liu, Jing
    Jiang, Jie
    Li, Yong
    Bao, Yongjun
    Lu, Hanqing
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (06) : 2547 - 2560
  • [9] Dual Attention Network for Scene Segmentation
    Fu, Jun
    Liu, Jing
    Tian, Haijie
    Li, Yong
    Bao, Yongjun
    Fang, Zhiwei
    Lu, Hanqing
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3141 - 3149
  • [10] Howard AG, 2017, Arxiv, DOI [arXiv:1704.04861, DOI 10.48550/ARXIV.1704.04861]