FNOSEG3D: RESOLUTION-ROBUST 3D IMAGE SEGMENTATION WITH FOURIER NEURAL OPERATOR

被引：1

作者：

Wong, Ken C. L. ^{[1
]}

Wang, Hongzhi ^{[1
]}

Syeda-Mahmood, Tanveer ^{[1
]}

机构：

[1] IBM Res, Almaden Res Ctr, San Jose, CA 95120 USA

来源：

2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI | 2023年

关键词：

Image segmentation; deep learning; neural operator; Fourier transform; zero-shot super-resolution; TRANSFORMER;

D O I：

10.1109/ISBI53787.2023.10230586

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Due to the computational complexity of 3D medical image segmentation, training with downsampled images is a common remedy for out-of-memory errors in deep learning. Nevertheless, as standard spatial convolution is sensitive to variations in image resolution, the accuracy of a convolutional neural network trained with downsampled images can be suboptimal when applied on the original resolution. To address this limitation, we introduce FNOSeg3D, a 3D segmentation model robust to training image resolution based on the Fourier neural operator (FNO). The FNO is a deep learning framework for learning mappings between functions in partial differential equations, which has the appealing properties of zero-shot super-resolution and global receptive field. We improve the FNO by reducing its parameter requirement and enhancing its learning capability through residual connections and deep supervision, and these result in our FNOSeg3D model which is parameter efficient and resolution robust. When tested on the BraTS'19 dataset, it achieved superior robustness to training image resolution than other tested models with less than 1% of their model parameters.

引用

页数：5

共 15 条

[1] Ba JL, 2016, arXiv
[2] Bakas S, 2019, Arxiv, DOI [arXiv:1811.02629, 10.48550/arXiv.1811.02629, DOI 10.48550/ARXIV.1811.02629]
[3] UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation
Gao, Yunhe
Zhou, Mu
Metaxas, Dimitris N.
[J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 : 61 - 71
[4] UNETR: Transformers for 3D Medical Image Segmentation
Hatamizadeh, Ali
Tang, Yucheng
Nath, Vishwesh
Yang, Dong
Myronenko, Andriy
Landman, Bennett
Roth, Holger R.
Xu, Daguang
[J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1748 - 1758
[5] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[6] Deep Learning Techniques for Medical Image Segmentation: Achievements and Challenges
Hesamian, Mohammad Hesam
Jia, Wenjing
He, Xiangjian
Kennedy, Paul
[J]. JOURNAL OF DIGITAL IMAGING, 2019, 32 (04) : 582 - 596
[7] Kingma DP, 2014, ADV NEUR IN, V27
[8] Klambauer G., 2017, Advances in neural information processing systems, P971
[9] Lee CY, 2015, JMLR WORKSH CONF PRO, V38, P562
[10] Li Zhiyuan, 2021, INT C LEARNING REPRE

← 1 2 →