SwinT-Unet: Ultrasound Image Segmentation Based on Two-Channel Self-Attention Mechanism

被引：0

作者：

Song, Yan-Tao ^{[1
,2
]}

Lu, Yun-Li ^{[1
]}

机构：

[1] Institute of Big Data Science & Industry, Shanxi University, Shanxi, Taiyuan

[2] School of Computer & Information Technology, Shanxi University, Shanxi, Taiyuan

来源：

Tien Tzu Hsueh Pao/Acta Electronica Sinica | 2024年 / 52卷 / 11期

关键词：

image segmentation; medical image; Swin-Transformer; ultrasound image; Unet;

D O I：

10.12263/DZXB.20230904

中图分类号：

学科分类号：

摘要：

Ultrasound image segmentation plays a key role in disease diagnosis and treatment, but accurately seg⁃ menting the regions of interest is still a challenging task due to the low contrast, noise interference, and variability in shape, size, and location of the lesions in ultrasound images. To address this problem, we propose a dual-channel self-attention mechanism U-shaped network (SwinT-Unet), which utilizes Swin-Transformer and Unet encoder to simultaneously extract features. To effectively fuse the different-level features extracted by Swin-Transformer and Unet encoder, we also propose a gated dual-layer feature fusion module (GDFF), which achieves the effective fusion of global and local features through the gating mechanism, thereby improving the accuracy and robustness of the segmentation results. We conduct experiments on two different ultrasound image datasets, and the results show that our proposed model outperforms the existing convolution⁃ al neural network and Transformer-based models in terms of segmentation accuracy and robustness. Our paper provides a new method for ultrasound image segmentation, and offers more accurate and reliable support for clinical medical diagnosis and treatment. © 2024 Chinese Institute of Electronics. All rights reserved.

引用

页码：3835 / 3846

页数：11

共 26 条

[1] HUANG X, HU Y B., Problems and countermeasure of clinical application of domestic medical equipment in Chi⁃ na, Chinese Medical Equipment Journal, 39, 9, pp. 75-78, (2018)
[2] ZHANG S J, PENG Z, LI H., SAU-Net: Medical image segmentation method based on U-Net and self-attention, Acta Electronica Sinica, 50, 10, pp. 2433-2442, (2022)
[3] RONNEBERGER O, FISCHER P, BROX T., U-Net: Convo⁃ lutional networks for biomedical image segmentation, Lecture Notes in Computer Science, pp. 234-241, (2015)
[4] ALOM M Z, HASAN M, YAKOPCIC C, Et al., Recurrent re⁃ sidual convolutional neural network based on U-Net (R2U-Net) for medical image segmentation
[5] MILLETARI F, NAVAB N, AHMADI S A., V-net: Fully convolutional neural networks for volumetric medical im⁃ age segmentation, 2016 Fourth International Confer⁃ ence on 3D Vision (3DV), pp. 565-571, (2016)
[6] VASWANI A, SHAZEER N, PARMAR N, Et al., Atten⁃ tion is all you need, Proceedings of the 31st Internation⁃ al Conference on Neural Information Processing Systems, pp. 6000-6010, (2017)
[7] RAJAMANI K T, RANI P, SIEBERT H, Et al., Attention-aug⁃ mented U-Net (AA-U-Net) for semantic segmentation, Signal, Image and Video Processing, 17, 4, pp. 981-989, (2023)
[8] DOSOVITSKIY A, BEYER L, KOLESNIKOV A, Et al., An image is worth 16x16 words: Transformers for image recognition at scale
[9] CHEN J N, LU Y Y, YU Q H, Et al., TransUNet: Trans⁃ formers make strong encoders for medical image segmenta⁃ tion
[10] VALANARASU J M J, OZA P, HACIHALILOGLU I, Et al., Medical transformer: Gated axial-attention for medical image segmentation, Lecture Notes in Computer Sci⁃ ence, pp. 36-46, (2021)

← 1 2 3 →