A Deep Learning Model to Triage Screening Mammograms: A Simulation Study

被引:126
|
作者
Yala, Adam [1 ]
Schuster, Tal [1 ]
Miles, Randy [2 ]
Barzilay, Regina [1 ]
Lehman, Constance [2 ]
机构
[1] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA 02139 USA
[2] Harvard Med Sch, Dept Radiol, Massachusetts Gen Hosp, 55 Fruit St,WAC 240, Boston, MA 02114 USA
关键词
COMPUTER-AIDED DETECTION; BREAST-CANCER; RANDOMIZED-TRIAL; PERFORMANCE; MORTALITY; 10-YEAR;
D O I
10.1148/radiol.2019182908
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Background: Recent deep learning (DL) approaches have shown promise in improving sensitivity but have not addressed limitations in radiologist specificity or efficiency. Purpose: To develop a DL model to triage a portion of mammograms as cancer free, improving performance and workflow efficiency. Materials and Methods: In this retrospective study, 223 109 consecutive screening mammograms performed in 66 661 women from January 2009 to December 2016 were collected with cancer outcomes obtained through linkage to a regional tumor registry. This cohort was split by patient into 212 272, 25 999, and 26 540 mammograms from 56 831, 7021, and 7176 patients for training,validation, and testing, respectively. A DL model was developed to triage mammograms as cancer free and evaluated on the test set. A DL-triage workflow was simulated in which radiologists skipped mammograms triaged as cancer free (interpreting them as negative for cancer) and read mammograms not triaged as cancer free by using the original interpreting radiologists' assessments.Sensitivities, specificities, and percentage of mammograms read were calculated, with and without the DL-triage-simulated workflow. Statistics were computed across 5000 bootstrap samples to assess confidence intervals (CIs). Specificities were compared by using a two-tailed t test (P < .05) and sensitivities were compared by using a one-sided t test with a non inferiority margin of 5% (P < .05). Results: The test set included 7176 women (mean age, 57.8 years +/- 10.9 [standard deviation]). When reading all mammograms, radiologists obtained a sensitivity and specificity of 90.6% (173 of 191; 95% CI: 86.6%, 94.7%) and 93.5% (24 625 of 26 349; 95% CI: 93.3%, 93.9%). In the DL-simulated workflow, the radiologists obtained a sensitivity and specificity of 90.1% (172 of 191; 95% CI: 86.0%, 94.3%) and 94.2% (24 814 of 26 349; 95% CI: 94.0%, 94.6%) while reading 80.7% (21 420 of 26 540) of the mammograms. The simulated workflow improved specificity (P = .002) and obtained a noninferior sensitivity with a margin of 5% (P < .001). Conclusion: This deep learning model has the potential to reduce radiologist workload and significantly improve specificity without harming sensitivity. (C) RSNA, 2019
引用
收藏
页码:38 / 46
页数:9
相关论文
共 50 条
  • [31] Classification of Breast Cancer in Mammograms with Deep Learning Adding a Fifth Class
    Castro-Tapia, Salvador
    Castaneda-Miranda, Celina Lizeth
    Olvera-Olvera, Carlos Alberto
    Guerrero-Osuna, Hector A.
    Ortiz-Rodriguez, Jose Manuel
    Martinez-Blanco, Ma. del Rosario
    Diaz-Florez, German
    Mendiola-Santibanez, Jorge Domingo
    Solis-Sanchez, Luis Octavio
    APPLIED SCIENCES-BASEL, 2021, 11 (23):
  • [32] Deep learning based screening model for hip diseases on plain radiographs
    Park, Jung-Wee
    Ryu, Seung Min
    Kim, Hong-Seok
    Lee, Young-Kyun
    Yoo, Jeong Joon
    PLOS ONE, 2025, 20 (02):
  • [33] Variations in screening outcome among pairs of screening radiologists at non-blinded double reading of screening mammograms: a population-based study
    Klompenhouwer, E. G.
    Duijm, L. E. M.
    Voogd, A. C.
    den Heeten, G. J.
    Nederend, J.
    Jansen, F. H.
    Broeders, M. J. M.
    EUROPEAN RADIOLOGY, 2014, 24 (05) : 1097 - 1104
  • [34] Can a Machine Learn from Radiologists’ Visual Search Behaviour and Their Interpretation of Mammograms—a Deep-Learning Study
    Suneeta Mall
    Patrick C. Brennan
    Claudia Mello-Thoms
    Journal of Digital Imaging, 2019, 32 : 746 - 760
  • [35] An end-to-end deep learning model can detect the gist of the abnormal in prior mammograms as perceived by experienced radiologists
    Gandomkar, Ziba
    Ekpo, Ernest
    Lewis, Sarah
    Suleiman, Moayyad
    Siviengphanom, Somphone
    Li, Tong
    Brennan, Patrick
    MEDICAL IMAGING 2021: IMAGE PERCEPTION, OBSERVER PERFORMANCE, AND TECHNOLOGY ASSESSMENT, 2021, 11599
  • [36] Breast Mammograms Diagnosis Using Deep Learning: State of Art Tutorial Review
    Naeem, Osama Bin
    Saleem, Yasir
    Khan, M. Usman Ghani
    Khan, Amjad Rehman
    Saba, Tanzila
    Bahaj, Saeed Ali
    Ayesha, Noor
    ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2024, 31 (04) : 2431 - 2449
  • [37] Automated Mass Detection in Mammograms using Cascaded Deep Learning and Random Forests
    Dhungel, Neeraj
    Carneiro, Gustavo
    Bradley, Andrew P.
    2015 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2015, : 160 - 167
  • [38] Supplementary Breast Ultrasound Screening in Asian Women with Negative But Dense Mammograms-A Pilot Study
    Leong, Lester C. H.
    Gogna, Apoorva
    Pant, Rita
    Ng, Fook Cheong
    Sim, Llewellyn S. J.
    ANNALS ACADEMY OF MEDICINE SINGAPORE, 2012, 41 (10) : 432 - 439
  • [39] Mass detection in mammograms using pre-trained deep learning models
    Agarwal, Richa
    Diaz, Oliver
    Llado, Xavier
    Marti, Robert
    14TH INTERNATIONAL WORKSHOP ON BREAST IMAGING (IWBI 2018), 2018, 10718
  • [40] Deep learning modeling using normal mammograms for predicting breast cancer risk
    Arefan, Dooman
    Mohamed, Aly A.
    Berg, Wendie A.
    Zuley, Margarita L.
    Sumkin, Jules H.
    Wu, Shandong
    MEDICAL PHYSICS, 2020, 47 (01) : 110 - 118