Does deep learning software improve the consistency and performance of radiologists with various levels of experience in assessing bi-parametric prostate MRI?

被引:3
作者
Arslan, Aydan [1 ]
Alis, Deniz [2 ]
Erdemli, Servet [2 ]
Seker, Mustafa Ege [3 ]
Zeybel, Gokberk [3 ]
Sirolu, Sabri [4 ]
Kurtcan, Serpil [5 ]
Karaarslan, Ercan [2 ]
机构
[1] Umraniye Training & Res Hosp, Dept Radiol, Istanbul, Turkiye
[2] Acibadem Mehmet Ali Aydinlar Univ, Sch Med, Dept Radiol, Istanbul, Turkiye
[3] Acibadem Mehmet Ali Aydinlar Univ, Sch Med, Istanbul, Turkiye
[4] Istanbul Sisli Hamidiye Etfal Training & Res Hosp, Dept Radiol, Istanbul, Turkiye
[5] Acibadem Healthcare Grp, Dept Radiol, Istanbul, Turkiye
关键词
Deep learning; Magnetic resonance imaging; Prostate cancer; CANCER; ACCURACY; SYSTEM;
D O I
10.1186/s13244-023-01386-w
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
ObjectiveTo investigate whether commercially available deep learning (DL) software improves the Prostate Imaging-Reporting and Data System (PI-RADS) scoring consistency on bi-parametric MRI among radiologists with various levels of experience; to assess whether the DL software improves the performance of the radiologists in identifying clinically significant prostate cancer (csPCa).MethodsWe retrospectively enrolled consecutive men who underwent bi-parametric prostate MRI at a 3 T scanner due to suspicion of PCa. Four radiologists with 2, 3, 5, and > 20 years of experience evaluated the bi-parametric prostate MRI scans with and without the DL software. Whole-mount pathology or MRI/ultrasound fusion-guided biopsy was the reference. The area under the receiver operating curve (AUROC) was calculated for each radiologist with and without the DL software and compared using De Long's test. In addition, the inter-rater agreement was investigated using kappa statistics.ResultsIn all, 153 men with a mean age of 63.59 +/- 7.56 years (range 53-80) were enrolled in the study. In the study sample, 45 men (29.80%) had clinically significant PCa. During the reading with the DL software, the radiologists changed their initial scores in 1/153 (0.65%), 2/153 (1.3%), 0/153 (0%), and 3/153 (1.9%) of the patients, yielding no significant increase in the AUROC (p > 0.05). Fleiss' kappa scores among the radiologists were 0.39 and 0.40 with and without the DL software (p = 0.56).ConclusionsThe commercially available DL software does not increase the consistency of the bi-parametric PI-RADS scoring or csPCa detection performance of radiologists with varying levels of experience.
引用
收藏
页数:10
相关论文
共 34 条
  • [1] MRI-Targeted, Systematic, and Combined Biopsy for Prostate Cancer Diagnosis
    Ahdoot, Michael
    Wilbur, Andrew R.
    Reese, Sarah E.
    Lebastchi, Amir H.
    Mehralivand, Sherif
    Gomella, Patrick T.
    Bloom, Jonathan
    Gurram, Sandeep
    Siddiqui, Minhaj
    Pinsky, Paul
    Parnes, Howard
    Linehan, W. Marston
    Merino, Maria
    Choyke, Peter L.
    Shih, Joanna H.
    Turkbey, Baris
    Wood, Bradford J.
    Pinto, Peter A.
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2020, 382 (10) : 917 - 928
  • [2] Diagnostic accuracy of multi-parametric MRI and TRUS biopsy in prostate cancer (PROMIS): a paired validating confirmatory study
    Ahmed, Hashim U.
    Bosaily, Ahmed El-Shater
    Brown, Louise C.
    Gabe, Rhian
    Kaplan, Richard
    Parmar, Mahesh K.
    Collaco-Moraes, Yolanda
    Ward, Katie
    Hindley, Richard G.
    Freeman, Alex
    Kirkham, Alex P.
    Oldroyd, Robert
    Parker, Chris
    Emberton, Mark
    [J]. LANCET, 2017, 389 (10071) : 815 - 822
  • [3] Semi-automatic classification of prostate cancer on multi-parametric MR imaging using a multi-channel 3D convolutional neural network
    Aldoj, Nader
    Lukas, Steffen
    Dewey, Marc
    Penzkofer, Tobias
    [J]. EUROPEAN RADIOLOGY, 2020, 30 (02) : 1243 - 1253
  • [4] American Urologic Association, 2022, OPT TECHN PROST BIOP
  • [5] End-to-end lung cancer screening with three-dimensional deep learning on low-dose chest computed tomography
    Ardila, Diego
    Kiraly, Atilla P.
    Bharadwaj, Sujeeth
    Choi, Bokyung
    Reicher, Joshua J.
    Peng, Lily
    Tse, Daniel
    Etemadi, Mozziyar
    Ye, Wenxing
    Corrado, Greg
    Naidich, David P.
    Shetty, Shravya
    [J]. NATURE MEDICINE, 2019, 25 (06) : 954 - +
  • [6] Prebiopsy Biparametric MRI for Clinically Significant Prostate Cancer Detection With PI-RADS Version 2: A Multicenter Study
    Choi, Moon Hyung
    Kim, Chan Kyo
    Lee, Young Joon
    Jung, Seung Eun
    [J]. AMERICAN JOURNAL OF ROENTGENOLOGY, 2019, 212 (04) : 839 - 846
  • [7] A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES
    COHEN, J
    [J]. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) : 37 - 46
  • [8] Machine learning for the identification of clinically significant prostate cancer on MRI: a meta-analysis
    Cuocolo, Renato
    Cipullo, Maria Brunella
    Stanzione, Arnaldo
    Romeo, Valeria
    Green, Roberta
    Cantoni, Valeria
    Ponsiglione, Andrea
    Ugga, Lorenzo
    Imbriaco, Massimo
    [J]. EUROPEAN RADIOLOGY, 2020, 30 (12) : 6877 - 6887
  • [9] ESUR/ESUI consensus statements on multi-parametric MRI for the detection of clinically significant prostate cancer: quality requirements for image acquisition, interpretation and radiologists' training
    de Rooij, Maarten
    Israel, Bas
    Tummers, Marcia
    Ahmed, Hashim U.
    Barrett, Tristan
    Giganti, Francesco
    Hamm, Bernd
    Logager, Vibeke
    Padhani, Anwar
    Panebianco, Valeria
    Puech, Philippe
    Richenberg, Jonathan
    Rouviere, Olivier
    Salomon, Georg
    Schoots, Ivo
    Veltman, Jeroen
    Villeirs, Geert
    Walz, Jochen
    Barentsz, Jelle O.
    [J]. EUROPEAN RADIOLOGY, 2020, 30 (10) : 5404 - 5416
  • [10] Dong Yang, 2017, Medical Image Computing and Computer Assisted Intervention MICCAI 2017. 20th International Conference. Proceedings: LNCS 10435, P507, DOI 10.1007/978-3-319-66179-7_58