Improving 3-D Medical Image Segmentation at Boundary Regions Using Local Self-Attention and Global Volume Mixing

被引：1

作者：

Abdul Kareem D.N. ^{[1
]}

Fiaz M. ^{[1
]}

Novershtern N. ^{[2
]}

Hanna J. ^{[2
]}

Cholakkal H. ^{[1
]}

机构：

[1] Mohamed Bin Zayed University of Artificial Intelligence, Abu Dhabi

[2] Weizmann Institute of Science, Rehovot

来源：

IEEE Transactions on Artificial Intelligence | 2024年 / 5卷 / 06期

关键词：

Attention; medical image segmentation; MLP-mixer; transfer learning;

D O I：

10.1109/TAI.2023.3346833

中图分类号：

学科分类号：

摘要：

Volumetric medical image segmentation is a fundamental problem in medical image analysis where the objective is to accurately classify a given 3-D volumetric medical image with voxel-level precision. In this work, we propose a novel hierarchical encoder-decoder-based framework that strives to explicitly capture the local and global dependencies for volumetric 3-D medical image segmentation. The proposed framework exploits local volume-based self-attention to encode the local dependencies at high resolution and introduces a novel volumetric multi-layer perceptron (MLP)-mixer to capture the global dependencies at low-resolution feature representations, respectively. The proposed volumetric MLP-mixer learns better associations among volumetric feature representations. These explicit local and global feature representations contribute to better learning of the shape-boundary characteristics of the organs. Extensive experiments on three different datasets reveal that the proposed method achieves favorable performance compared to state-of-the-art approaches. On the challenging Synapse Multiorgan dataset, the proposed method achieves an absolute 3.82% gain over the state-of-the-art approaches in terms of HD95 evaluation metrics while a similar improvement pattern is exhibited in Medical Segmentation Decathlon (MSD) Liver and Pancreas tumor datasets. We also provide a detailed comparison between recent architectural design choices in the 2-D computer vision literature by adapting them for the problem of 3-D medical image segmentation. Finally, our experiments on the ZebraFish 3-D cell membrane dataset having limited training data demonstrate the superior transfer learning capabilities of the proposed vMixer model on the challenging 3-D cell instance segmentation task, where accurate boundary prediction plays a vital role in distinguishing individual cell instances. © 2020 IEEE.

引用

页码：3233 / 3244

页数：11

共 46 条

[1] Isensee F., Et al., nnU-Net: Self-adapting framework for U-Net-based medical image segmentation, (2018)
[2] Xu X., Zhou F., Liu B., Fu D., Bai X., Efficient multiple organ localization in CT image using 3D region proposal network, IEEE Trans. Med. Imag., 38, 8, pp. 1885-1898, (2019)
[3] Myronenko A., 3D MRI brain tumor segmentation using autoencoder regularization, Proc. Int. MICCAI Brainlesion Workshop, pp. 311-320, (2019)
[4] Chen W., Liu B., Peng S., Sun J., Qiao X., S3D-UNet: Separable 3D U-Net for brain tumor segmentation, Proc. Int. MICCAI Brainlesion Workshop, pp. 358-368, (2019)
[5] Ronneberger O., Fischer P., Brox T., U-Net: Convolutional networks for biomedical image segmentation, Proc. Int. Conf. Med. Image Comput. Comput.-Assisted Intervention, pp. 234-241, (2015)
[6] Milletari F., Navab N., Ahmadi S.-A., V-Net: Fully convolutional neural networks for volumetric medical image segmentation, Proc. 4th Int. Conf. 3D Vision (3DV), pp. 565-571, (2016)
[7] Nuechterlein N., Mehta S., 3D-ESPNet with pyramidal refinement for volumetric brain tumor image segmentation, Proc. Int. MICCAI Brainlesion Workshop, pp. 245-253, (2019)
[8] Cao H., Et al., Swin-Unet: Unet-like pure transformer for medical image segmentation, (2021)
[9] Huang X., Deng Z., Li D., Yuan X., MISSFormer: An effective medical image segmentation transformer, (2021)
[10] Zhou H.-Y., Guo J., Zhang Y., Yu L., Wang L., Yu Y., nn-Former: Interleaved transformer for volumetric segmentation, (2021)

← 1 2 3 4 5 →