Challenges in Using Deep Neural Networks Across Multiple Readers in Delineating Prostate Gland Anatomy

被引:0
作者
Abudalou, Shatha [1 ,2 ]
Choi, Jung [3 ]
Gage, Kenneth [3 ]
Pow-Sang, Julio [4 ]
Yilmaz, Yasin [2 ]
Balagurunathan, Yoganand [1 ,2 ,3 ,4 ]
机构
[1] H Lee Moffitt Canc Ctr & Res Inst, Dept Machine Learning, Tampa, FL 33612 USA
[2] Univ S Florida, Dept Elect Engn, Tampa, FL 33620 USA
[3] H Lee Moffitt Canc Ctr & Res Inst, Dept Diagnost Radiol, Tampa, FL 33612 USA
[4] H Lee Moffitt Canc Ctr & Res Inst, Dept Genitourinary Oncol, Tampa, FL 33612 USA
来源
JOURNAL OF IMAGING INFORMATICS IN MEDICINE | 2025年
关键词
Reproducibility of deep network; Prostate gland segmentation; Multi-reader variability; SEGMENTATION; MRI; ALGORITHMS;
D O I
10.1007/s10278-025-01504-8
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Deep learning methods provide enormous promise in automating manually intense tasks such as medical image segmentation and provide workflow assistance to clinical experts. Deep neural networks (DNN) require a significant amount of training examples and a variety of expert opinions to capture the nuances and the context, a challenging proposition in oncological studies (H. Wang et al., Nature, vol. 620, no. 7972, pp. 47-60, Aug 2023). Inter-reader variability among clinical experts is a real-world problem that severely impacts the generalization of DNN reproducibility. This study proposes quantifying the variability in DNN performance using expert opinions and exploring strategies to train the network and adapt between expert opinions. We address the inter-reader variability problem in the context of prostate gland segmentation using a well-studied DNN, the 3D U-Net model. Reference data includes magnetic resonance imaging (MRI, T2-weighted) with prostate glandular anatomy annotations from two expert readers (R#1, n = 342 and R#2, n = 204). 3D U-Net was trained and tested with individual expert examples (R#1 and R#2) and had an average Dice coefficient of 0.825 (CI, [0.81 0.84]) and 0.85 (CI, [0.82 0.88]), respectively. Combined training with a representative cohort proportion (R#1, n = 100 and R#2, n = 150) yielded enhanced model reproducibility across readers, achieving an average test Dice coefficient of 0.863 (CI, [0.85 0.87]) for R#1 and 0.869 (CI, [0.87 0.88]) for R#2. We re-evaluated the model performance across the gland volumes (large, small) and found improved performance for large gland size with an average Dice coefficient to be at 0.846 [CI, 0.82 0.87] and 0.872 [CI, 0.86 0.89] for R#1 and R#2, respectively, estimated using fivefold cross-validation. Performance for small gland sizes diminished with average Dice of 0.8 [0.79, 0.82] and 0.8 [0.79, 0.83] for R#1 and R#2, respectively.
引用
收藏
页数:16
相关论文
共 65 条
[1]  
Abdalbagi F., 2020, Signal & Image Processing: An International Journal (SIPIJ), V11
[2]   Prostate158-An expert-annotated 3T MRI dataset and algorithm for prostate cancer detection [J].
Adams, Lisa C. ;
Makowski, Marcus R. ;
Engel, Guenther ;
Rattunde, Maximilian ;
Busch, Felix ;
Asbach, Patrick ;
Niehues, Stefan M. ;
Vinayahalingam, Shankeeth ;
van Ginneken, Bram ;
Litjens, Geert ;
Bressem, Keno K. .
COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 148
[3]  
Albayrak A, 2016, INT SYMP COMP INTELL, P335, DOI 10.1109/CINTI.2016.7846429
[4]   Automatic prostate and prostate zones segmentation of magnetic resonance images using DenseNet-like U-net [J].
Aldoj, Nader ;
Biavati, Federico ;
Michallek, Florian ;
Stober, Sebastian ;
Dewey, Marc .
SCIENTIFIC REPORTS, 2020, 10 (01)
[5]   Medical Image Analysis using Convolutional Neural Networks: A Review [J].
Anwar, Syed Muhammad ;
Majid, Muhammad ;
Qayyum, Adnan ;
Awais, Muhammad ;
Alnowami, Majdi ;
Khan, Muhammad Khurram .
JOURNAL OF MEDICAL SYSTEMS, 2018, 42 (11)
[6]   Lung and Infection CT-Scan-Based Segmentation with 3D UNet Architecture and Its Modification [J].
Asnawi, Mohammad Hamid ;
Pravitasari, Anindya Apriliyanti ;
Darmawan, Gumgum ;
Hendrawati, Triyani ;
Yulita, Intan Nurma ;
Suprijadi, Jadi ;
Nugraha, Farid Azhar Lutfi .
HEALTHCARE, 2023, 11 (02)
[7]   Comparing 3D, 2.5D, and 2D Approaches to Brain Image Auto-Segmentation [J].
Avesta, Arman ;
Hossain, Sajid ;
Lin, MingDe ;
Aboian, Mariam ;
Krumholz, Harlan M. ;
Aneja, Sanjay .
BIOENGINEERING-BASEL, 2023, 10 (02)
[8]   On the usage of average Hausdorff distance for segmentation performance assessment: hidden error when used for ranking [J].
Aydin, Orhun Utku ;
Taha, Abdel Aziz ;
Hilbert, Adam ;
Khalil, Ahmed A. ;
Galinovic, Ivana ;
Fiebach, Jochen B. ;
Frey, Dietmar ;
Madai, Vince Istvan .
EUROPEAN RADIOLOGY EXPERIMENTAL, 2021, 5 (01)
[9]   A multi-object deep neural network architecture to detect prostate anatomy in T2-weighted MRI: Performance evaluation [J].
Baldeon-Calisto, Maria ;
Wei, Zhouping ;
Abudalou, Shatha ;
Yilmaz, Yasin ;
Gage, Kenneth ;
Pow-Sang, Julio ;
Balagurunathan, Yoganand .
FRONTIERS IN NUCLEAR MEDICINE, 2023, 2
[10]   Quality checkpoints in the MRI-directed prostate cancer diagnostic pathway [J].
Barrett, Tristan ;
de Rooij, Maarten ;
Giganti, Francesco ;
Allen, Clare ;
Barentsz, Jelle O. ;
Padhani, Anwar R. .
NATURE REVIEWS UROLOGY, 2023, 20 (01) :9-22