DilatedSegNet: A Deep Dilated Segmentation Network for Polyp Segmentation

被引:9
作者
Tomar, Nikhil Kumar [1 ]
Jha, Debesh [1 ]
Bagci, Ulas [1 ]
机构
[1] Northwestern Univ, Dept Radiol, Machine & Hybrid Intelligence Lab, Chennai, India
来源
MULTIMEDIA MODELING, MMM 2023, PT I | 2023年 / 13833卷
关键词
Deep learning; Polyp segmentation; Colonoscopy; Residual network; Generalization; Real-time segmentation;
D O I
10.1007/978-3-031-27077-2_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Colorectal cancer (CRC) is the second leading cause of cancer-related death worldwide. Excision of polyps during colonoscopy helps reduce mortality and morbidity for CRC. Powered by deep learning, computer-aided diagnosis (CAD) systems can detect regions in the colon overlooked by physicians during colonoscopy. Lacking high accuracy and real-time speed are the essential obstacles to be overcome for successful clinical integration of such systems. While literature is focused on improving accuracy, the speed parameter is often ignored. Toward this critical need, we intend to develop a novel real-time deep learning-based architecture, DilatedSegNet, to perform polyp segmentation on the fly. DilatedSegNet is an encoder-decoder network that uses pre-trained ResNet50 as the encoder from which we extract four levels of feature maps. Each of these feature maps is passed through a dilated convolution pooling (DCP) block. The outputs from the DCP blocks are concatenated and passed through a series of four decoder blocks that predicts the segmentation mask. The proposed method achieves a real-time operation speed of 33.68 frames per second with an average dice coefficient (DSC) of 0.90 and mIoU of 0.83. Additionally, we also provide heatmap along with the qualitative results that shows the explanation for the polyp location, which increases the trustworthiness of the method. The results on the publicly available Kvasir-SEG and BKAI-IGH datasets suggest that DilatedSegNet can give real-time feedback while retaining a high DSC, indicating high potential for using such models in real clinical settings in the near future. The GitHub link of the source code can be found here: https://github.com/nikhilroxtomar/DilatedSegNet.
引用
收藏
页码:334 / 344
页数:11
相关论文
共 22 条
  • [1] WM-DOVA maps for accurate polyp highlighting in colonoscopy: Validation vs. saliency maps from physicians
    Bernal, Jorge
    Javier Sanchez, F.
    Fernandez-Esparrach, Gloria
    Gil, Debora
    Rodriguez, Cristina
    Vilarino, Fernando
    [J]. COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2015, 43 : 99 - 111
  • [2] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [3] Deng-Ping Fan, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12266), P263, DOI 10.1007/978-3-030-59725-2_26
  • [4] Res2Net: A New Multi-Scale Backbone Architecture
    Gao, Shang-Hua
    Cheng, Ming-Ming
    Zhao, Kai
    Zhang, Xin-Yu
    Yang, Ming-Hsuan
    Torr, Philip
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (02) : 652 - 662
  • [5] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [6] Huang CH., 2021, arXiv, DOI 10.48550/arXiv.2101.07172
  • [7] Real-Time Polyp Detection, Localization and Segmentation in Colonoscopy Using Deep Learning
    Jha, Debesh
    Ali, Sharib
    Tomar, Nikhil Kumar
    Johansen, Havard D.
    Johansen, Dag
    Rittscher, Jens
    Riegler, Michael A.
    Halvorsen, Pal
    [J]. IEEE ACCESS, 2021, 9 : 40496 - 40510
  • [8] Kvasir-SEG: A Segmented Polyp Dataset
    Jha, Debesh
    Smedsrud, Pia H.
    Riegler, Michael A.
    Halvorsen, Pal
    de Lange, Thomas
    Johansen, Dag
    Johansen, Havard D.
    [J]. MULTIMEDIA MODELING (MMM 2020), PT II, 2020, 11962 : 451 - 462
  • [9] ResUNet plus plus : An Advanced Architecture for Medical Image Segmentation
    Jha, Debesh
    Smedsrud, Pia H.
    Riegler, Michael A.
    Johansen, Dag
    de Lange, Thomas
    Halvorsen, Pal
    Johansen, Havard D.
    [J]. 2019 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2019), 2019, : 225 - 230
  • [10] Kingma DP, 2014, ADV NEUR IN, V27