Artificial Intelligence in Fracture Detection: A Systematic Review and Meta-Analysis

被引:121
作者
Kuo, Rachel Y. L. [1 ]
Harrison, Conrad [1 ]
Curran, Terry-Ann [2 ]
Jones, Benjamin [3 ]
Freethy, Alexander [2 ]
Cussons, David [4 ]
Stewart, Max [1 ]
Collins, Gary S. [1 ,5 ]
Furniss, Dominic [1 ]
机构
[1] Botnar Res Ctr, Nuffield Dept Orthoped Rheumatol & Musculoskeleta, Old Rd Headington, Oxford OX3 7LD, England
[2] John Radcliffe Hosp, Dept Plast Surg, Oxford, England
[3] Royal Berkshire Hosp, Dept Vasc Surg, Reading, Berks, England
[4] Stoke Mandeville Hosp, Dept Plast Surg, Aylesbury, Bucks, England
[5] Univ Oxford, UK EQUATOR Ctr, Ctr Stat Med, Nuffield Dept Orthopaed Rheumatol & Musculoskelet, Oxford, England
关键词
AUTOMATED CLASSIFICATION; RIB FRACTURES; DEEP; ACCURACY; ALGORITHM; SAMPLE; BIAS;
D O I
10.1148/radiol.211785
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Background: Patients with fractures are a common emergency presentation and may be misdiagnosed at radiologic imaging. An increasing number of studies apply artificial intelligence (AI) techniques to fracture detection as an adjunct to clinician diagnosis. Purpose: To perform a systematic review and meta-analysis comparing the diagnostic performance in fracture detection between AI and clinicians in peer-reviewed publications and the gray literature (ie, articles published on preprint repositories). Materials and Methods: A search of multiple electronic databases between January 2018 and July 2020 (updated June 2021) was performed that included any primary research studies that developed and/or validated AI for the purposes of fracture detection at any imaging modality and excluded studies that evaluated image segmentation algorithms. Meta-analysis with a hierarchical model to calculate pooled sensitivity and specificity was used. Risk of bias was assessed by using a modified Prediction Model Study Risk of Bias Assessment Tool, or PROBAST, checklist. Results: Included for analysis were 42 studies, with 115 contingency tables extracted from 32 studies (55 061 images). Thirty-seven studies identified fractures on radiographs and five studies identified fractures on CT images. For internal validation test sets, the pooled sensitivity was 92% (95% CI: 88, 93) for AI and 91% (95% CI: 85, 95) for clinicians, and the pooled specificity was 91% (95% CI: 88, 93) for AI and 92% (95% CI: 89, 92) for clinicians. For external validation test sets, the pooled sensitivity was 91% (95% CI: 84, 95) for AI and 94% (95% CI: 90, 96) for clinicians, and the pooled specificity was 91% (95% CI: 81, 95) for AI and 94% (95% CI: 91, 95) for clinicians. There were no statistically significant differences between clinician and AI performance. There were 22 of 42 (52%) studies that were judged to have high risk of bias. Meta-regression identified multiple sources of heterogeneity in the data, including risk of bias and fracture type. Conclusion: Artificial intelligence (AI) and clinicians had comparable reported diagnostic performance in fracture detection, suggesting that AI technology holds promise as a diagnostic adjunct in future clinical practice. (C) RSNA, 2022
引用
收藏
页码:50 / 62
页数:13
相关论文
共 71 条
[1]   Computer vs human: Deep learning versus perceptual training for the detection of neck of femur fractures [J].
Adams, Matthew ;
Chen, Weijia ;
Holcdorf, David ;
McCusker, Mark W. ;
Howe, Piers D. L. ;
Gaillard, Frank .
JOURNAL OF MEDICAL IMAGING AND RADIATION ONCOLOGY, 2019, 63 (01) :27-32
[2]   Trends in Fracture Incidence: A Population-Based Study Over 20 Years [J].
Amin, Shreyasee ;
Achenbach, Sara J. ;
Atkinson, Elizabeth J. ;
Khosla, Sundeep ;
Melton, L. Joseph, III .
JOURNAL OF BONE AND MINERAL RESEARCH, 2014, 29 (03) :581-589
[3]   Sample-Size Determination Methodologies for Machine Learning in Medical Imaging Research: A Systematic Review [J].
Balki, Indranil ;
Amirabadi, Afsaneh ;
Levman, Jacob ;
Martel, Anne L. ;
Emersic, Ziga ;
Meden, Blaz ;
Garcia-Pedrero, Angel ;
Ramirez, Saul C. ;
Kong, Dehan ;
Moody, Alan R. ;
Tyrrell, Pascal N. .
CANADIAN ASSOCIATION OF RADIOLOGISTS JOURNAL-JOURNAL DE L ASSOCIATION CANADIENNE DES RADIOLOGISTES, 2019, 70 (04) :344-353
[4]   2018 ACR Commission on Human Resources Workforce Survey [J].
Bender, Claire E. ;
Bansal, Swati ;
Wolfman, Darcy ;
Parikh, Jay R. .
JOURNAL OF THE AMERICAN COLLEGE OF RADIOLOGY, 2019, 16 (04) :508-512
[5]   Fracture incidence in adults in relation to age and gender: A study of 27,169 fractures in the Swedish Fracture Register in a well-defined catchment area [J].
Bergh, Camilla ;
Wennergren, David ;
Moller, Michael ;
Brisby, Helena .
PLOS ONE, 2020, 15 (12)
[6]   Femoral neck fracture detection in X-ray images using deep learning and genetic algorithm approaches [J].
Beyaz, Salih ;
Acici, Koray ;
Sumer, Emre .
JOINT DISEASES AND RELATED SURGERY, 2020, 31 (02) :175-183
[7]   Detection and localization of distal radius fractures: Deep learning system versus radiologists [J].
Bluethgen, Christian ;
Becker, Anton S. ;
de Martini, Ilaria Vittoria ;
Meier, Andreas ;
Martini, Katharina ;
Frauenfelder, Thomas .
EUROPEAN JOURNAL OF RADIOLOGY, 2020, 126
[8]   Application of deep learning algorithm to detect and visualize vertebral fractures on plain frontal radiographs [J].
Chen, Hsuan-Yu ;
Hsu, Benny Wei-Yun ;
Yin, Yu-Kai ;
Lin, Feng-Huei ;
Yang, Tsung-Han ;
Yang, Rong-Sen ;
Lee, Chih-Kuo ;
Tseng, Vincent S. .
PLOS ONE, 2021, 16 (01)
[9]   A scalable physician-level deep learning algorithm detects universal trauma on pelvic radiographs [J].
Cheng, Chi-Tung ;
Wang, Yirui ;
Chen, Huan-Wu ;
Hsiao, Po-Meng ;
Yeh, Chun-Nan ;
Hsieh, Chi-Hsun ;
Miao, Shun ;
Xiao, Jing ;
Liao, Chien-Hung ;
Lu, Le .
NATURE COMMUNICATIONS, 2021, 12 (01)
[10]   A Human-Algorithm Integration System for Hip Fracture Detection on Plain Radiography: System Development and Validation Study [J].
Cheng, Chi-Tung ;
Chen, Chih-Chi ;
Cheng, Fu-Jen ;
Chen, Huan-Wu ;
Su, Yi-Siang ;
Yeh, Chun-Nan ;
Chung, I-Fang ;
Liao, Chien-Hung .
JMIR MEDICAL INFORMATICS, 2020, 8 (11)