Demystifying the effect of receptive field size in U-Net models for medical image segmentation

被引:0
作者
Loos, Vincent [1 ]
Pardasani, Rohit [2 ]
Awasthi, Navchetan [1 ,3 ]
机构
[1] Univ Amsterdam, Informat Inst, Fac Sci Math & Comp Sci, Amsterdam, Netherlands
[2] Gen Elect Healthcare, Bengaluru, Karnataka, India
[3] Amsterdam UMC, Dept Biomed Engn & Phys, Amsterdam, Netherlands
关键词
effective receptive field; receptive field; segmentation; theoretical receptive field; U-Net;
D O I
10.1117/1.JMI.11.5.054004
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Purpose: Medical image segmentation is a critical task in healthcare applications, and U-Nets have demonstrated promising results in this domain. We delve into the understudied aspect of receptive field (RF) size and its impact on the U-Net and attention U-Net architectures used for medical imaging segmentation. Approach: We explore several critical elements including the relationship among RF size, characteristics of the region of interest, and model performance, as well as the balance between RF size and computational costs for U-Net and attention U-Net methods for different datasets. We also propose a mathematical notation for representing the theoretical receptive field (TRF) of a given layer in a network and propose two new metrics, namely, the effective receptive field (ERF) rate and the object rate, to quantify the fraction of significantly contributing pixels within the ERF against the TRF area and assessing the relative size of the segmentation object compared with the TRF size, respectively. Results: The results demonstrate that there exists an optimal TRF size that successfully strikes a balance between capturing a wider global context and maintaining computational efficiency, thereby optimizing model performance. Interestingly, a distinct correlation is observed between the data complexity and the required TRF size; segmentation based solely on contrast achieved peak performance even with smaller TRF sizes, whereas more complex segmentation tasks necessitated larger TRFs. Attention U-Net models consistently outperformed their U-Net counterparts, highlighting the value of attention mechanisms regardless of TRF size. Conclusions: These insights present an invaluable resource for developing more efficient U-Net-based architectures for medical imaging and pave the way for future exploration of other segmentation architectures. A tool is also developed, which calculates the TRF for a U-Net (and attention U-Net) model and also suggests an appropriate TRF size for a given model and dataset.
引用
收藏
页数:25
相关论文
共 34 条
  • [1] Araujo A., 2019, Distill, V4, pe21, DOI DOI 10.23915/DISTILL.00021
  • [2] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [3] Behboodi B, 2020, IEEE ENG MED BIO, P2117, DOI [10.1109/EMBC44109.2020.9175846, 10.1109/embc44109.2020.9175846]
  • [4] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [5] Daniel A. J., 2021, T2-weighted kidney MRI segmentation
  • [6] Automated renal segmentation in healthy and chronic kidney disease subjects using a convolutional neural network
    Daniel, Alexander J.
    Buchanan, Charlotte E.
    Allcock, Thomas
    Scerri, Daniel
    Cox, Eleanor F.
    Prestwich, Benjamin L.
    Francis, Susan T.
    [J]. MAGNETIC RESONANCE IN MEDICINE, 2021, 86 (02) : 1125 - 1136
  • [7] Danilov V., 2022, Mendeley Data, V10
  • [8] Dumoulin V, 2018, Arxiv, DOI arXiv:1603.07285
  • [9] How Bandwidth Selection Algorithms Impact Exploratory Data Analysis Using Kernel Density Estimation
    Harpole, Jared K.
    Woods, Carol M.
    Rodebaugh, Thomas L.
    Levinson, Cheri A.
    Lenze, Eric J.
    [J]. PSYCHOLOGICAL METHODS, 2014, 19 (03) : 428 - 443
  • [10] Deep Learning Techniques for Medical Image Segmentation: Achievements and Challenges
    Hesamian, Mohammad Hesam
    Jia, Wenjing
    He, Xiangjian
    Kennedy, Paul
    [J]. JOURNAL OF DIGITAL IMAGING, 2019, 32 (04) : 582 - 596