PolySegNet: improving polyp segmentation through swin transformer and vision transformer fusion

被引：1

作者：

Lijin, P. ^{[1
]}

Ullah, Mohib ^{[2
]}

Vats, Anuja ^{[2
]}

Cheikh, Faouzi Alaya ^{[3
]}

Kumar, G. Santhosh ^{[1
]}

Nair, Madhu S. ^{[1
]}

机构：

[1] Cochin Univ Sci & Technol, Dept Comp Sci, Artificial Intelligence & Comp Vis Lab, Kochi 682022, Kerala, India

[2] Norwegian Univ Sci & Technol, Teknol Vegen 22, N-2815 Gjovik, Norway

[3] Norwegian Univ Sci & Technol, Norwegian Colour & Visual Comp Lab, Teknol Vegen 22, N-2815 Gjovik, Norway

来源：

BIOMEDICAL ENGINEERING LETTERS | 2024年 / 14卷 / 06期

关键词：

Swin transformer; Vision transformer; Convolutional neural network; Colorectal cancer; Segmentation;

D O I：

10.1007/s13534-024-00415-x

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

Colorectal cancer ranks as the second most prevalent cancer worldwide, with a high mortality rate. Colonoscopy stands as the preferred procedure for diagnosing colorectal cancer. Detecting polyps at an early stage is critical for effective prevention and diagnosis. However, challenges in colonoscopic procedures often lead medical practitioners to seek support from alternative techniques for timely polyp identification. Polyp segmentation emerges as a promising approach to identify polyps in colonoscopy images. In this paper, we propose an advanced method, PolySegNet, that leverages both Vision Transformer and Swin Transformer, coupled with a Convolutional Neural Network (CNN) decoder. The fusion of these models facilitates a comprehensive analysis of various modules in our proposed architecture.To assess the performance of PolySegNet, we evaluate it on three colonoscopy datasets, a combined dataset, and their augmented versions. The experimental results demonstrate that PolySegNet achieves competitive results in terms of polyp segmentation accuracy and efficacy, achieving a mean Dice score of 0.92 and a mean Intersection over Union (IoU) of 0.86. These metrics highlight the superior performance of PolySegNet in accurately delineating polyp boundaries compared to existing methods. PolySegNet has shown great promise in accurately and efficiently segmenting polyps in medical images. The proposed method could be the foundation for a new class of transformer-based segmentation models in medical image analysis.

引用

页码：1421 / 1431

页数：11

共 35 条

[1] Attention-based generative adversarial network with internal damage segmentation using thermography
Ali, Rahmat
Cha, Young-Jin
[J]. AUTOMATION IN CONSTRUCTION, 2022, 141
[2] Towards automatic polyp detection with a polyp appearance model
Bernal, J.
Sanchez, J.
Vilarino, F.
[J]. PATTERN RECOGNITION, 2012, 45 (09) : 3166 - 3182
[3] WM-DOVA maps for accurate polyp highlighting in colonoscopy: Validation vs. saliency maps from physicians
Bernal, Jorge
Javier Sanchez, F.
Fernandez-Esparrach, Gloria
Gil, Debora
Rodriguez, Cristina
Vilarino, Fernando
[J]. COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2015, 43 : 99 - 111
[4] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
Chen, Liang-Chieh
Zhu, Yukun
Papandreou, George
Schroff, Florian
Adam, Hartwig
[J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
[5] Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709
[6] SDDNet: Real-Time Crack Segmentation
Choi, Wooram
Cha, Young-Jin
[J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2020, 67 (09) : 8016 - 8025
[7] Deng-Ping Fan, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12266), P263, DOI 10.1007/978-3-030-59725-2_26
[8] Dosovitskiy A., ARXIV
[9] UNETR: Transformers for 3D Medical Image Segmentation
Hatamizadeh, Ali
Tang, Yucheng
Nath, Vishwesh
Yang, Dong
Myronenko, Andriy
Landman, Bennett
Roth, Holger R.
Xu, Daguang
[J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1748 - 1758
[10] HiFormer: Hierarchical Multi-scale Representations Using Transformers for Medical Image Segmentation
Heidari, Moein
Kazerouni, Amirhossein
Soltany, Milad
Azad, Reza
Aghdam, Ehsan Khodapanah
Cohen-Adad, Julien
Merhof, Dorit
[J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 6191 - 6201

← 1 2 3 4 →