Multiscale transunet + + : dense hybrid U-Net with transformer for medical image segmentation

被引：0

作者：

Bo Wang

·Fan Wang

Pengwei Dong

·Chongyi Li

机构：

[1] Ningxia University,School of Physics and Electronic

[2] Nanyang Technological University,Electrical Engineering

来源：

Signal, Image and Video Processing | 2022年 / 16卷

关键词：

Medical image segmentation; CNN; Transformer; Weighted loss function;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Automatic medical image segmentation as assistance to doctors is important for diagnosis and treatment of various diseases. TransUNet that integrates the advantages of transformer and CNN has achieved success in medical image segmentation tasks. However, TransUNet simply combines feature maps between encoder and decoder via skip connections at the same resolution, which leads to be an unnecessarily restrictive fusion design. Moreover, the positional encoding and input tokens in standard transformer blocks of TransUNet have a fixed scale, which are not suitable for dense prediction. To alleviate the above problems, in this paper, we propose a novel architecture named multiscale TransUNet + + (MS-TransUNet + +), which employs a multiscale and flexible feature fusion scheme between encoder and decoder at different levels. The novel skip connections densely bridge the extracted feature representations with different resolutions, and the hybrid CNN-Transformer encoder with long-range dependencies directly passes the high-level features to each stage of decoder. Besides, in order to obtain more effective feature representations, an efficient multi-scale visual transformer is introduced for feature encoder. More importantly, we employ a weighted loss function composed of focal, multiscale structure similarity and Jaccard index to penalize the training error of medical image segmentation, jointly realizing pixel-level, patch-level and map-level optimization. Extensive experimental results demonstrate that our proposed multiscale TransUNet + + can achieve competitive performance for prostate MR and liver CT image segmentation.

引用

页码：1607 / 1614

页数：7

共 42 条

[1]

Gu R(2021)CA-Net: comprehensive attention convolutional neural networks for explainable medical image segmentation IEEE Trans. Med. Imaging 40 699-711

[2]

Wang G(2017)3-D active contour segmentation based on sparse linear combination of training shapes (SCoTS) IEEE Trans. Med. Imaging 36 2239-2249

[3]

Song T(2018)Multi-atlas segmentation of MR tumor brain images using low-rank based image recovery IEEE Trans. Med. Imaging 37 2224-2235

[4]

Farhangi MM(2020)‘Squeeze & excite’ guided few shot segmentation of volumetric images Med. Image Anal. 59 1-12

[5]

Frigui H(2021)Inter-slice context residual learning for 3D medical image segmentation IEEE Trans. Med. Imaging 40 661-672

[6]

Seow A(2017)A survey on deep learning in medical image analysis Med. Image Anal. 42 60-88

[7]

Tang Z(2019)UNet++: redesigning skip connections to exploit multiscale features in image segmentation IEEE Trans. Med. Imaging 39 1856-1867

[8]

Ahmad S(2018)H-denseunet: hybrid densely connected unet for liver and tumor segmentation from ct volumes IEEE Trans. Med. Imaging 37 2663-2674

[9]

Yap PT(2019)Deeply supervised 3D fully convolutional networks with group dilated convolution for automatic MRI prostate segmentation Med. Phys. 46 1707-1718

[10]

Roy AG(2020)A multiple-channel and atrous convolution network for ultrasound image segmentation Med. Phys. 47 6270-6285

← 1 2 3 4 5 →