Machine Learning for Workflow Applications in Screening Mammography: Systematic Review and Meta-Analysis

被引:69
作者
Hickman, Sarah E. [1 ]
Woitek, Ramona [1 ,3 ,4 ]
Le, Elizabeth Phuong Vi [2 ]
Im, Yu Ri [2 ]
Luxhoj, Carina Mouritsen [2 ]
Aviles-Rivero, Angelica, I [5 ]
Baxter, Gabrielle C. [1 ]
MacKay, James W. [1 ,6 ]
Gilbert, Fiona J. [1 ,3 ]
机构
[1] Univ Cambridge, Sch Clin Med, Dept Radiol, Box 218,Cambridge Biomed Campus, Cambridge CB2 0QQ, England
[2] Univ Cambridge, Sch Clin Med, Dept Med, Box 218,Cambridge Biomed Campus, Cambridge CB2 0QQ, England
[3] Cambridge Univ Hosp Natl Hlth Serv Fdn Trust, Addenbrookes Hosp, Dept Radiol, Cambridge, England
[4] Med Univ Vienna, Dept Biomed Imaging & Image Guided Therapy, Vienna, Austria
[5] Univ Cambridge, Dept Pure Math & Math Stat, Cambridge, England
[6] Univ East Anglia, Norwich Med Sch, Norwich, Norfolk, England
基金
英国工程与自然科学研究理事会;
关键词
COMPUTER-AIDED DETECTION; ARTIFICIAL-INTELLIGENCE; CANCER; PERFORMANCE; ACCURACY; TOOL;
D O I
10.1148/radiol.2021210391
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Background: Advances in computer processing and improvements in data availability have led to the development of machine learning (ML) techniques for mammographic imaging. Purpose: To evaluate the reported performance of stand-alone ML applications for screening mammography workflow. Materials and Methods: Ovid Embase, Ovid Medline, Cochrane Central Register of Controlled Trials, Scopus, and Web of Science literature databases were searched for relevant studies published from January 2012 to September 2020. The study was registered with the PROSPERO International Prospective Register of Systematic Reviews (protocol no. CRD42019156016). Stand-alone technology was defined as a ML algorithm that can be used independently of a human reader. Studies were quality assessed using the Quality Assessment of Diagnostic Accuracy Studies 2 and the Prediction Model Risk of Bias Assessment Tool, and reporting was evaluated using the Checklist for Artificial Intelligence in Medical Imaging. A primary meta-analysis included the top-performing algorithm and corresponding reader performance from which pooled summary estimates for the area under the receiver operating characteristic curve (AUC) were calculated using a bivariate model. Results: Fourteen articles were included, which detailed 15 studies for stand-alone detection (n = 8) and triage (n = 7). Triage studies reported that 17%-91% of normal mammograms identified could be read by adapted screening, while "missing" an estimated 0%-7% of cancers. In total, an estimated 185 252 cases from three countries with more than 39 readers were included in the primary meta-analysis. The pooled sensitivity, specificity, and AUC was 75.4% (95% CI: 65.6, 83.2; P =.11), 90.6% (95% CI: 82.9, 95.0; P =.40), and 0.89 (95% CI: 0.84, 0.98), respectively, for algorithms, and 73.0% (95% CI: 60.7, 82.6), 88.6% (95% CI: 72.4, 95.8), and 0.85 (95% CI: 0.78, 0.97), respectively, for readers. Conclusion: Machine learning (ML) algorithms that demonstrate a stand-alone application in mammographic screening -workflows achieve or even exceed human reader detection performance and improve efficiency. However, this evidence is from a small number of retrospective studies. Therefore, further rigorous independent external prospective testing of ML algorithms to assess performance at preassigned thresholds is required to support these claims. (C) RSNA, 2021
引用
收藏
页码:88 / 104
页数:17
相关论文
共 48 条
[1]  
Balta C, 2020, P SPIE 11513 15 INT
[2]  
Canty A., 2020, BOOT BOOTSTRAP R S P
[3]  
ClinicalTrials.gov, EXPT US INN COMP VIS
[4]  
ClinicalTrials.gov, Development of Artificial Intelligence System for Detection and Diagnosis of Breast Lesion Using Mammography
[5]   US Preventive Services Task Force and Breast Cancer Screening [J].
DeAngelis, Catherine D. ;
Fontanarosa, Phil B. .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2010, 303 (02) :172-173
[6]  
Dembrower K, 2020, LANCET DIGIT HEALTH, V2, pE468, DOI 10.1016/S2589-7500(20)30185-0
[7]  
Diab AR, 2019, ARXIV191211027CSEESS, P1
[8]  
Doebler P., 2020, MAD MET DIAGN ACC R
[9]  
Geras KJ., 2017, High-resolution breast cancer screening with multi-view deep convolutional neural networks
[10]   Single reading with computer-aided detection for screening mammography [J].
Gilbert, Fiona J. ;
Astley, Susan M. ;
Gillan, Maureen G. C. ;
Agbaje, Olorunsola F. ;
Wallis, Matthew G. ;
James, Jonathan ;
Boggis, Caroline R. M. ;
Duffy, Stephen W. .
NEW ENGLAND JOURNAL OF MEDICINE, 2008, 359 (16) :1675-1684