Using deep learning to assist readers during the arbitration process: a lesion-based retrospective evaluation of breast cancer screening performance

被引:8
|
作者
Kerschke, Laura [1 ]
Weigel, Stefanie [2 ,3 ]
Rodriguez-Ruiz, Alejandro [4 ]
Karssemeijer, Nico [4 ,5 ]
Heindel, Walter [2 ,3 ]
机构
[1] Univ Munster, Inst Biostat & Clin Res, IBKF, Schmeddingstr 56, D-48149 Munster, Germany
[2] Univ Munster, Clin Radiol & Reference Ctr Mammog Muenster, Albert Schweitzer Campus 1, D-48149 Munster, Germany
[3] Univ Hosp Muenster, Albert Schweitzer Campus 1, D-48149 Munster, Germany
[4] ScreenPoint Med BV, Toernooiveld 300, NL-6525 EC Nijmegen, Netherlands
[5] Radboud Univ Nijmegen, Dept Radiol & Nucl Med, Med Ctr, Geert Grootepl Zuid 10, NL-6525 GA Nijmegen, Netherlands
关键词
Breast cancer; Screening; Mammography; Artificial intelligence; DUCTAL CARCINOMA; AI;
D O I
10.1007/s00330-021-08217-w
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Objectives To evaluate if artificial intelligence (AI) can discriminate recalled benign from recalled malignant mammographic screening abnormalities to improve screening performance. Methods A total of 2257 full-field digital mammography screening examinations, obtained 2011-2013, of women aged 50-69 years which were recalled for further assessment of 295 malignant out of 305 truly malignant lesions and 2289 benign lesions after independent double-reading with arbitration, were included in this retrospective study. A deep learning AI system was used to obtain a score (0-95) for each recalled lesion, representing the likelihood of breast cancer. The sensitivity on the lesion level and the proportion of women without false-positive ratings (non-FPR) resulting under AI were estimated as a function of the classification cutoff and compared to that of human readers. Results Using a cutoff of 1, AI decreased the proportion of women with false-positives from 89.9 to 62.0%, non-FPR 11.1% vs. 38.0% (difference 26.9%, 95% confidence interval 25.1-28.8%; p < .001), preventing 30.1% of reader-induced false-positive recalls, while reducing sensitivity from 96.7 to 91.1% (5.6%, 3.1-8.0%) as compared to human reading. The positive predictive value of recall (PPV-1) increased from 12.8 to 16.5% (3.7%, 3.5-4.0%). In women with mass-related lesions (n = 900), the non-FPR was 14.2% for humans vs. 36.7% for AI (22.4%, 19.8-25.3%) at a sensitivity of 98.5% vs. 97.1% (1.5%, 0-3.5%). Conclusion The application of AI during consensus conference might especially help readers to reduce false-positive recalls of masses at the expense of a small sensitivity reduction. Prospective studies are needed to further evaluate the screening benefit of AI in practice.
引用
收藏
页码:842 / 852
页数:11
相关论文
共 50 条
  • [41] Multi-Input Deep Learning Approach for Breast Cancer Screening Using Thermal Infrared Imaging and Clinical Data
    Tsietso, Dennies
    Yahya, Abid
    Samikannu, Ravi
    Tariq, Muhammad Usman
    Babar, Muhammad
    Qureshi, Basit
    Koubaa, Anis
    IEEE ACCESS, 2023, 11 : 52101 - 52116
  • [42] Community-Based Breast Cancer Screening Using Digital Breast Tomosynthesis Versus Digital Mammography: Comparison of Screening Performance and Tumor Characteristics
    Regen-Tuero, Helaina C.
    Ram, Shruthi
    Gass, Jennifer S.
    Lourenco, Ana P.
    AMERICAN JOURNAL OF ROENTGENOLOGY, 2022, 218 (02) : 249 - 256
  • [43] Performance of breast cancer screening using digital breast tomosynthesis: results from the prospective population-based Oslo Tomosynthesis Screening Trial
    Skaane, Per
    Sebuodegard, Sofie
    Bandos, Andriy I.
    Gur, David
    Osteras, Bjorn Helge
    Gullien, Randi
    Hofvind, Solveig
    BREAST CANCER RESEARCH AND TREATMENT, 2018, 169 (03) : 489 - 496
  • [44] Diagnostic Performance of Deep Learning-Based Lesion Detection Algorithm in CT for Detecting Hepatic Metastasis from Colorectal Cancer
    Kim, Kiwook
    Kim, Sungwon
    Han, Kyunghwa
    Bae, Heejin
    Shin, Jaeseung
    Lim, Joon Seok
    KOREAN JOURNAL OF RADIOLOGY, 2021, 22 (06) : 912 - 921
  • [45] Deep learning-based breast cancer diagnosis with multiview of mammography screening to reduce false positive recall rate
    Karagoz, Meryem Altin
    Nalbantoglu, O. Ufuk
    Karaboga, Dervis
    Akay, Bahriye
    Basturk, Alper
    Ulutabanca, Halil
    Dogan, Serap
    Coskun, Damla
    Demir, Osman
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2024, 32 (03) : 382 - 402
  • [46] Deep Learning- and Expert Knowledge-Based Feature Extraction and Performance Evaluation in Breast Histopathology Images
    Kode, Hepseeba
    Barkana, Buket D.
    CANCERS, 2023, 15 (12)
  • [47] External Evaluation of a Mammography-based Deep Learning Model for Predicting Breast Cancer in an Ethnically Diverse Population
    Omoleye, Olasubomi J.
    Woodard, Anna E.
    Howard, Frederick M.
    Zhao, Fangyuan
    Yoshimatsu, Toshio F.
    Zheng, Yonglan
    Pearson, Alexander T.
    Levental, Maksinz
    Aribisala, Benjamin S.
    Kulkarni, Kirti
    Karczmar, Gregory S.
    Olopade, Obfunmilayo, I
    Abe, Hiroyuki
    Huo, Dezheng
    RADIOLOGY-ARTIFICIAL INTELLIGENCE, 2023, 5 (06)
  • [48] Mammography and ultrasound based dual modality classification of breast cancer using a hybrid deep learning approach
    Atrey, Kushangi
    Singh, Bikesh Kumar
    Bodhey, Narendra K.
    Pachori, Ram Bilas
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 86
  • [49] Breast Cancer Classification From Histopathological Images Using Patch-Based Deep Learning Modeling
    Hirra, Irum
    Ahmad, Mubashir
    Hussain, Ayaz
    Ashraf, M. Usman
    Saeed, Iftikhar Ahmed
    Qadri, Syed Furqan
    Alghamdi, Ahmed M.
    Alfakeeh, Ahmed S.
    IEEE ACCESS, 2021, 9 : 24273 - 24287
  • [50] Machine learning-based diagnostic evaluation of shear-wave elastography in BI-RADS category 4 breast cancer screening: a multicenter, retrospective study
    Tang, Yi
    Liang, Minjie
    Tao, Li
    Deng, Minjun
    Li, Tianfu
    QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2022, 12 (02) : 1223 - 1234