Improving 3-D Medical Image Segmentation at Boundary Regions Using Local Self-Attention and Global Volume Mixing

被引:1
作者
Abdul Kareem D.N. [1 ]
Fiaz M. [1 ]
Novershtern N. [2 ]
Hanna J. [2 ]
Cholakkal H. [1 ]
机构
[1] Mohamed Bin Zayed University of Artificial Intelligence, Abu Dhabi
[2] Weizmann Institute of Science, Rehovot
来源
IEEE Transactions on Artificial Intelligence | 2024年 / 5卷 / 06期
关键词
Attention; medical image segmentation; MLP-mixer; transfer learning;
D O I
10.1109/TAI.2023.3346833
中图分类号
学科分类号
摘要
Volumetric medical image segmentation is a fundamental problem in medical image analysis where the objective is to accurately classify a given 3-D volumetric medical image with voxel-level precision. In this work, we propose a novel hierarchical encoder-decoder-based framework that strives to explicitly capture the local and global dependencies for volumetric 3-D medical image segmentation. The proposed framework exploits local volume-based self-attention to encode the local dependencies at high resolution and introduces a novel volumetric multi-layer perceptron (MLP)-mixer to capture the global dependencies at low-resolution feature representations, respectively. The proposed volumetric MLP-mixer learns better associations among volumetric feature representations. These explicit local and global feature representations contribute to better learning of the shape-boundary characteristics of the organs. Extensive experiments on three different datasets reveal that the proposed method achieves favorable performance compared to state-of-the-art approaches. On the challenging Synapse Multiorgan dataset, the proposed method achieves an absolute 3.82% gain over the state-of-the-art approaches in terms of HD95 evaluation metrics while a similar improvement pattern is exhibited in Medical Segmentation Decathlon (MSD) Liver and Pancreas tumor datasets. We also provide a detailed comparison between recent architectural design choices in the 2-D computer vision literature by adapting them for the problem of 3-D medical image segmentation. Finally, our experiments on the ZebraFish 3-D cell membrane dataset having limited training data demonstrate the superior transfer learning capabilities of the proposed vMixer model on the challenging 3-D cell instance segmentation task, where accurate boundary prediction plays a vital role in distinguishing individual cell instances. © 2020 IEEE.
引用
收藏
页码:3233 / 3244
页数:11
相关论文
共 46 条
  • [1] Isensee F., Et al., nnU-Net: Self-adapting framework for U-Net-based medical image segmentation, (2018)
  • [2] Xu X., Zhou F., Liu B., Fu D., Bai X., Efficient multiple organ localization in CT image using 3D region proposal network, IEEE Trans. Med. Imag., 38, 8, pp. 1885-1898, (2019)
  • [3] Myronenko A., 3D MRI brain tumor segmentation using autoencoder regularization, Proc. Int. MICCAI Brainlesion Workshop, pp. 311-320, (2019)
  • [4] Chen W., Liu B., Peng S., Sun J., Qiao X., S3D-UNet: Separable 3D U-Net for brain tumor segmentation, Proc. Int. MICCAI Brainlesion Workshop, pp. 358-368, (2019)
  • [5] Ronneberger O., Fischer P., Brox T., U-Net: Convolutional networks for biomedical image segmentation, Proc. Int. Conf. Med. Image Comput. Comput.-Assisted Intervention, pp. 234-241, (2015)
  • [6] Milletari F., Navab N., Ahmadi S.-A., V-Net: Fully convolutional neural networks for volumetric medical image segmentation, Proc. 4th Int. Conf. 3D Vision (3DV), pp. 565-571, (2016)
  • [7] Nuechterlein N., Mehta S., 3D-ESPNet with pyramidal refinement for volumetric brain tumor image segmentation, Proc. Int. MICCAI Brainlesion Workshop, pp. 245-253, (2019)
  • [8] Cao H., Et al., Swin-Unet: Unet-like pure transformer for medical image segmentation, (2021)
  • [9] Huang X., Deng Z., Li D., Yuan X., MISSFormer: An effective medical image segmentation transformer, (2021)
  • [10] Zhou H.-Y., Guo J., Zhang Y., Yu L., Wang L., Yu Y., nn-Former: Interleaved transformer for volumetric segmentation, (2021)