Algorithm Versus Expert: Machine Learning Versus Surgeon-Predicted Symptom Improvement After Carpal Tunnel Release

被引:1
作者
Loos, Nina Louisa [1 ,2 ]
Hoogendam, Lisa [1 ,2 ,3 ]
Souer, John Sebastiaan [3 ]
van Uchelen, Jeroen Hein [3 ]
Slijper, Harm Pieter [3 ]
Wouters, Robbert Maarten [1 ,2 ]
Selles, Ruud Willem [1 ,2 ]
机构
[1] Erasmus MC, Dept Rehabil Med, Room EE-1589,POB 2040, NL-3000 CA Rotterdam, Netherlands
[2] Erasmus MC, Dept Plast & Reconstruct Surg & Hand Surg, Room EE-1589,POB 2040, NL-3000 CA Rotterdam, Netherlands
[3] Xpert Clin, Hand & Wrist Ctr, Eindhoven, Netherlands
关键词
Prediction; Carpal tunnel release; BCTQ; Hand surgery; Machine learning; Carpal tunnel syndrome; CLINICIAN PREDICTION; SURVIVAL; OUTCOMES; THERAPY; SCALE;
D O I
10.1227/neu.0000000000002848
中图分类号
R74 [神经病学与精神病学];
学科分类号
摘要
BACKGROUND AND OBJECTIVES: Surgeons rely on clinical experience when making predictions about treatment effects. Incorporating algorithm-based predictions of symptom improvement after carpal tunnel release (CTR) could support medical decision-making. However, these algorithm-based predictions need to outperform predictions made by surgeons to add value. We compared predictions of a validated prediction model for symptom improvement after CTR with predictions made by surgeons. METHODS: This cohort study included 97 patients scheduled for CTR. Preoperatively, surgeons estimated each patient's probability of improvement 6 months after surgery, defined as reaching the minimally clinically important difference on the Boston Carpal Tunnel Syndrome Symptom Severity Score. We assessed model and surgeon performance using calibration (calibration belts), discrimination (area under the curve [AUC]), sensitivity, and specificity. In addition, we assessed the net benefit of decision-making based on the prediction model's estimates vs the surgeon's judgement. RESULTS: The surgeon predictions had poor calibration and suboptimal discrimination (AUC 0.62, 95%-CI 0.49-0.74), while the prediction model showed good calibration and appropriate discrimination (AUC 0.77, 95%-CI 0.66-0.89, P = .05). The accuracy of surgeon predictions was 0.65 (95%-CI 0.37-0.78) vs 0.78 (95%-CI 0.67-0.89) for the prediction model (P = .03). The sensitivity of surgeon predictions and the prediction model was 0.72 (95%-CI 0.15-0.96) and 0.85 (95%-CI 0.62-0.97), respectively (P = .04). The specificity of the surgeon predictions was similar to the model's specificity (P = .25). The net benefit analysis showed better decision-making based on the prediction model compared with the surgeons' decision-making (ie, more correctly predicted improvements and/or fewer incorrectly predicted improvements). CONCLUSION: The prediction model outperformed surgeon predictions of improvement after CTR in terms of calibration, accuracy, and sensitivity. Furthermore, the net benefit analysis indicated that using the prediction model instead of relying solely on surgeon decision-making increases the number of patients who will improve after CTR, without increasing the number of unnecessary surgeries.
引用
收藏
页码:110 / 117
页数:8
相关论文
共 36 条
[1]   The Brief Illness Perception Questionnaire [J].
Broadbent, Elizabeth ;
Petrie, Keith J. ;
Main, Jodie ;
Weinman, John .
JOURNAL OF PSYCHOSOMATIC RESEARCH, 2006, 60 (06) :631-637
[2]   Machine Learning Improves Upon Clinicians' Prediction of End Stage Kidney Disease [J].
Chuah, Aaron ;
Walters, Giles ;
Christiadi, Daniel ;
Karpe, Krishna ;
Kennard, Alice ;
Singer, Richard ;
Talaulikar, Girish ;
Ge, Wenbo ;
Suominen, Hanna ;
Andrews, T. Daniel ;
Jiang, Simon .
FRONTIERS IN MEDICINE, 2022, 9
[3]   Cardiovascular Disease Risk Assessment: Insights from Framingham [J].
D'Agostino, Ralph B. ;
Pencina, Michael J. ;
Massaro, Joseph M. ;
Coady, Sean .
GLOBAL HEART, 2013, 8 (01) :11-23
[4]   Which Factors Are Associated With Satisfaction With Treatment Results in Patients With Hand and Wrist Conditions? A Large Cohort Analysis [J].
De Ridder, Willemijn Anna ;
Wouters, Robbert Maarten ;
Hoogendam, Lisa ;
Vermeulen, Guus Maarten ;
Slijper, Harm Pieter ;
Selles, Ruud Willem .
CLINICAL ORTHOPAEDICS AND RELATED RESEARCH, 2022, 480 (07) :1287-1301
[5]   COMPARING THE AREAS UNDER 2 OR MORE CORRELATED RECEIVER OPERATING CHARACTERISTIC CURVES - A NONPARAMETRIC APPROACH [J].
DELONG, ER ;
DELONG, DM ;
CLARKEPEARSON, DI .
BIOMETRICS, 1988, 44 (03) :837-845
[6]   Psychometric properties of the credibility/expectancy questionnaire [J].
Devilly, GJ ;
Borkovec, TD .
JOURNAL OF BEHAVIOR THERAPY AND EXPERIMENTAL PSYCHIATRY, 2000, 31 (02) :73-86
[7]   A Comparison of the Accuracy of Clinician Prediction of Survival Versus the Palliative Prognostic Index [J].
Farinholt, Paige ;
Park, Minjeong ;
Guo, Ying ;
Bruera, Eduardo ;
Hui, David .
JOURNAL OF PAIN AND SYMPTOM MANAGEMENT, 2018, 55 (03) :792-797
[8]   Factors Affecting Willingness to Undergo Carpal Tunnel Release [J].
Gong, Hyun Sik ;
Baek, Goo Hyun ;
Oh, Joo Han ;
Lee, Young Ho ;
Jeon, Suk Ha ;
Chung, Moon Sang .
JOURNAL OF BONE AND JOINT SURGERY-AMERICAN VOLUME, 2009, 91A (09) :2130-2136
[9]   THE MEANING AND USE OF THE AREA UNDER A RECEIVER OPERATING CHARACTERISTIC (ROC) CURVE [J].
HANLEY, JA ;
MCNEIL, BJ .
RADIOLOGY, 1982, 143 (01) :29-36
[10]  
Hoogendam L., Prediction of Symptom Improvement After Carpal Tunnel Surgery