PolySegNet: improving polyp segmentation through swin transformer and vision transformer fusion

被引:1
作者
Lijin, P. [1 ]
Ullah, Mohib [2 ]
Vats, Anuja [2 ]
Cheikh, Faouzi Alaya [3 ]
Kumar, G. Santhosh [1 ]
Nair, Madhu S. [1 ]
机构
[1] Cochin Univ Sci & Technol, Dept Comp Sci, Artificial Intelligence & Comp Vis Lab, Kochi 682022, Kerala, India
[2] Norwegian Univ Sci & Technol, Teknol Vegen 22, N-2815 Gjovik, Norway
[3] Norwegian Univ Sci & Technol, Norwegian Colour & Visual Comp Lab, Teknol Vegen 22, N-2815 Gjovik, Norway
关键词
Swin transformer; Vision transformer; Convolutional neural network; Colorectal cancer; Segmentation;
D O I
10.1007/s13534-024-00415-x
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Colorectal cancer ranks as the second most prevalent cancer worldwide, with a high mortality rate. Colonoscopy stands as the preferred procedure for diagnosing colorectal cancer. Detecting polyps at an early stage is critical for effective prevention and diagnosis. However, challenges in colonoscopic procedures often lead medical practitioners to seek support from alternative techniques for timely polyp identification. Polyp segmentation emerges as a promising approach to identify polyps in colonoscopy images. In this paper, we propose an advanced method, PolySegNet, that leverages both Vision Transformer and Swin Transformer, coupled with a Convolutional Neural Network (CNN) decoder. The fusion of these models facilitates a comprehensive analysis of various modules in our proposed architecture.To assess the performance of PolySegNet, we evaluate it on three colonoscopy datasets, a combined dataset, and their augmented versions. The experimental results demonstrate that PolySegNet achieves competitive results in terms of polyp segmentation accuracy and efficacy, achieving a mean Dice score of 0.92 and a mean Intersection over Union (IoU) of 0.86. These metrics highlight the superior performance of PolySegNet in accurately delineating polyp boundaries compared to existing methods. PolySegNet has shown great promise in accurately and efficiently segmenting polyps in medical images. The proposed method could be the foundation for a new class of transformer-based segmentation models in medical image analysis.
引用
收藏
页码:1421 / 1431
页数:11
相关论文
共 35 条
  • [1] Attention-based generative adversarial network with internal damage segmentation using thermography
    Ali, Rahmat
    Cha, Young-Jin
    [J]. AUTOMATION IN CONSTRUCTION, 2022, 141
  • [2] Towards automatic polyp detection with a polyp appearance model
    Bernal, J.
    Sanchez, J.
    Vilarino, F.
    [J]. PATTERN RECOGNITION, 2012, 45 (09) : 3166 - 3182
  • [3] WM-DOVA maps for accurate polyp highlighting in colonoscopy: Validation vs. saliency maps from physicians
    Bernal, Jorge
    Javier Sanchez, F.
    Fernandez-Esparrach, Gloria
    Gil, Debora
    Rodriguez, Cristina
    Vilarino, Fernando
    [J]. COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2015, 43 : 99 - 111
  • [4] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [5] Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709
  • [6] SDDNet: Real-Time Crack Segmentation
    Choi, Wooram
    Cha, Young-Jin
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2020, 67 (09) : 8016 - 8025
  • [7] Deng-Ping Fan, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12266), P263, DOI 10.1007/978-3-030-59725-2_26
  • [8] Dosovitskiy A., ARXIV
  • [9] UNETR: Transformers for 3D Medical Image Segmentation
    Hatamizadeh, Ali
    Tang, Yucheng
    Nath, Vishwesh
    Yang, Dong
    Myronenko, Andriy
    Landman, Bennett
    Roth, Holger R.
    Xu, Daguang
    [J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1748 - 1758
  • [10] HiFormer: Hierarchical Multi-scale Representations Using Transformers for Medical Image Segmentation
    Heidari, Moein
    Kazerouni, Amirhossein
    Soltany, Milad
    Azad, Reza
    Aghdam, Ehsan Khodapanah
    Cohen-Adad, Julien
    Merhof, Dorit
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 6191 - 6201