FNOSEG3D: RESOLUTION-ROBUST 3D IMAGE SEGMENTATION WITH FOURIER NEURAL OPERATOR

被引:1
作者
Wong, Ken C. L. [1 ]
Wang, Hongzhi [1 ]
Syeda-Mahmood, Tanveer [1 ]
机构
[1] IBM Res, Almaden Res Ctr, San Jose, CA 95120 USA
来源
2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI | 2023年
关键词
Image segmentation; deep learning; neural operator; Fourier transform; zero-shot super-resolution; TRANSFORMER;
D O I
10.1109/ISBI53787.2023.10230586
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the computational complexity of 3D medical image segmentation, training with downsampled images is a common remedy for out-of-memory errors in deep learning. Nevertheless, as standard spatial convolution is sensitive to variations in image resolution, the accuracy of a convolutional neural network trained with downsampled images can be suboptimal when applied on the original resolution. To address this limitation, we introduce FNOSeg3D, a 3D segmentation model robust to training image resolution based on the Fourier neural operator (FNO). The FNO is a deep learning framework for learning mappings between functions in partial differential equations, which has the appealing properties of zero-shot super-resolution and global receptive field. We improve the FNO by reducing its parameter requirement and enhancing its learning capability through residual connections and deep supervision, and these result in our FNOSeg3D model which is parameter efficient and resolution robust. When tested on the BraTS'19 dataset, it achieved superior robustness to training image resolution than other tested models with less than 1% of their model parameters.
引用
收藏
页数:5
相关论文
共 15 条
  • [1] Ba JL, 2016, arXiv
  • [2] Bakas S, 2019, Arxiv, DOI [arXiv:1811.02629, 10.48550/arXiv.1811.02629, DOI 10.48550/ARXIV.1811.02629]
  • [3] UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation
    Gao, Yunhe
    Zhou, Mu
    Metaxas, Dimitris N.
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 : 61 - 71
  • [4] UNETR: Transformers for 3D Medical Image Segmentation
    Hatamizadeh, Ali
    Tang, Yucheng
    Nath, Vishwesh
    Yang, Dong
    Myronenko, Andriy
    Landman, Bennett
    Roth, Holger R.
    Xu, Daguang
    [J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1748 - 1758
  • [5] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [6] Deep Learning Techniques for Medical Image Segmentation: Achievements and Challenges
    Hesamian, Mohammad Hesam
    Jia, Wenjing
    He, Xiangjian
    Kennedy, Paul
    [J]. JOURNAL OF DIGITAL IMAGING, 2019, 32 (04) : 582 - 596
  • [7] Kingma DP, 2014, ADV NEUR IN, V27
  • [8] Klambauer G., 2017, Advances in neural information processing systems, P971
  • [9] Lee CY, 2015, JMLR WORKSH CONF PRO, V38, P562
  • [10] Li Zhiyuan, 2021, INT C LEARNING REPRE