Machine learning prediction models in orthopedic surgery: A systematic review in transparent reporting

被引:49
作者
Groot, Olivier Q. [1 ]
Ogink, Paul T. [2 ]
Lans, Amanda [1 ]
Twining, Peter K. [1 ]
Kapoor, Neal D. [1 ]
DiGiovanni, William [1 ]
Bindels, Bas J. J. [2 ]
Bongers, Michiel E. R. [1 ]
Oosterhoff, Jacobien H. F. [1 ]
Karhade, Aditya, V [1 ]
Oner, F. C. [2 ]
Verlaan, Jorrit-Jan [2 ]
Schwab, Joseph H. [1 ]
机构
[1] Harvard Med Sch, Massachusetts Gen Hosp, Dept Orthoped Surg, Orthoped Oncol Serv, 55 Fruit St, Boston, MA 02114 USA
[2] Univ Utrecht, Univ Med Ctr Utrecht, Dept Orthoped Surg, Utrecht, Netherlands
关键词
machine learning; orthopedics; prediction models; RANDOMIZED-TRIALS; QUALITY; STATEMENT; CURVE; RISK; TOOL;
D O I
10.1002/jor.25036
中图分类号
R826.8 [整形外科学]; R782.2 [口腔颌面部整形外科学]; R726.2 [小儿整形外科学]; R62 [整形外科学(修复外科学)];
学科分类号
摘要
Machine learning (ML) studies are becoming increasingly popular in orthopedics but lack a critically appraisal of their adherence to peer-reviewed guidelines. The objective of this review was to (1) evaluate quality and transparent reporting of ML prediction models in orthopedic surgery based on the transparent reporting of multivariable prediction models for individual prognosis or diagnosis (TRIPOD), and (2) assess risk of bias with the Prediction model Risk Of Bias ASsessment Tool. A systematic review was performed to identify all ML prediction studies published in orthopedic surgery through June 18th, 2020. After screening 7138 studies, 59 studies met the study criteria and were included. Two reviewers independently extracted data and discrepancies were resolved by discussion with at least two additional reviewers present. Across all studies, the overall median completeness for the TRIPOD checklist was 53% (interquartile range 47%-60%). The overall risk of bias was low in 44% (n = 26), high in 41% (n = 24), and unclear in 15% (n = 9). High overall risk of bias was driven by incomplete reporting of performance measures, inadequate handling of missing data, and use of small datasets with inadequate outcome numbers. Although the number of ML studies in orthopedic surgery is increasing rapidly, over 40% of the existing models are at high risk of bias. Furthermore, over half incompletely reported their methods and/or performance measures. Until these issues are adequately addressed to give patients and providers trust in ML models, a considerable gap remains between the development of ML prediction models and their implementation in orthopedic practice.
引用
收藏
页码:475 / 483
页数:9
相关论文
共 30 条
[1]   Impact of the mandatory implementation of reporting guidelines on reporting quality in a surgical journal: A before and after study [J].
Agha, Riaz Ahmed ;
Fowler, Alexander J. ;
Limb, Christopher ;
Whitehurst, Katharine ;
Coe, Robert ;
Sagoo, Harkiran ;
Jafree, Daniyal J. ;
Chandrakumar, Charmilie ;
Gundogan, Buket .
INTERNATIONAL JOURNAL OF SURGERY, 2016, 30 :169-172
[2]   Epidemiology and reporting of randomised trials published in PubMed journals [J].
Chan, AW ;
Altman, DG .
LANCET, 2005, 365 (9465) :1159-1162
[3]   Reporting of artificial intelligence prediction models [J].
Collins, Gary S. ;
Moons, Karel G. M. .
LANCET, 2019, 393 (10181) :1577-1579
[4]   Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD): The TRIPOD Statement [J].
Collins, Gary S. ;
Reitsma, Johannes B. ;
Altman, Douglas G. ;
Moons, Karel G. M. .
EUROPEAN UROLOGY, 2015, 67 (06) :1142-1151
[5]   Use and misuse of the receiver operating characteristic curve in risk prediction [J].
Cook, Nancy R. .
CIRCULATION, 2007, 115 (07) :928-935
[6]   Impact and perceived value of journal reporting guidelines among Radiology authors and reviewers [J].
Dewey, Marc ;
Levine, Deborah ;
Bossuyt, Patrick M. ;
Kressel, Herbert Y. .
EUROPEAN RADIOLOGY, 2019, 29 (08) :3986-3995
[7]  
du Sert N.P., 2020, BRIT J PHARMACOL, V4, DOI DOI 10.1111/bph.15193
[8]   Does Artificial Intelligence Outperform Natural Intelligence in Interpreting Musculoskeletal Radiological Studies? A Systematic Review [J].
Groot, Olivier Q. ;
Bongers, Michiel E. R. ;
Ogink, Paul T. ;
Senders, Joeky T. ;
Karhade, Aditya V. ;
Bramer, Jos A. M. ;
Verlaan, Jorrit-Jan ;
Schwab, Joseph H. .
CLINICAL ORTHOPAEDICS AND RELATED RESEARCH, 2020, 478 (12) :2751-2764
[9]   Poor reporting of multivariable prediction model studies: towards a targeted implementation strategy of the TRIPOD statement [J].
Heus, Pauline ;
Damen, Johanna A. A. G. ;
Pajouheshnia, Romin ;
Scholten, Rob J. P. M. ;
Reitsma, Johannes B. ;
Collins, Gary S. ;
Altman, Douglas G. ;
Moons, Karel G. M. ;
Hooft, Lotty .
BMC MEDICINE, 2018, 16
[10]   Comparative repeatability of guide-pin axis positioning in computer-assisted and manual femoral head resurfacing arthroplasty [J].
Hodgson, A. ;
Helmy, N. ;
Masri, B. A. ;
Greidanus, N. V. ;
Inkpen, K. B. ;
Duncan, C. P. ;
Garbuz, D. S. ;
Anglin, C. .
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART H-JOURNAL OF ENGINEERING IN MEDICINE, 2007, 221 (H7) :713-724