Machine learning improves mortality risk prediction after cardiac surgery Systematic review and meta-analysis

被引:77
作者
Benedetto, Umberto [1 ]
Dimagli, Arnaldo [1 ]
Sinha, Shubhra [1 ]
Cocomello, Lucia [1 ]
Gibbison, Ben [1 ]
Caputo, Massimo [1 ]
Gaunt, Tom [2 ]
Lyon, Matt [2 ]
Holmes, Chris [3 ]
Angelini, Gianni D. [1 ]
机构
[1] Univ Bristol, Bristol Heart Inst, Dept Translat Hlth Sci, London, England
[2] Univ Bristol, Populat Hlth Sci, London, England
[3] Univ Oxford, Dept Stat, Oxford, England
基金
英国医学研究理事会;
关键词
risk model; prediction; mortality; machine learning; logistic regression; meta-analysis; ARTIFICIAL NEURAL-NETWORKS; BYPASS GRAFT-SURGERY; EUROSCORE II; LOGISTIC-REGRESSION; SOCIETY; MODELS; PERFORMANCE; VENTILATION;
D O I
10.1016/j.jtcvs.2020.07.105
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: Interest in the usefulness of machine learning (ML) methods for outcomes prediction has continued to increase in recent years. However, the advantage of advanced ML model over traditional logistic regression (LR) remains controversial. We performed a systematic review and meta-analysis of studies comparing the discrimination accuracy between ML models versus LR in predicting operative mortality following cardiac surgery. Methods: The present systematic review followed the Preferred Reporting Items for Systematic Reviews and Meta-Analysis statement. Discrimination ability was assessed using the C-statistic. Pooled C-statistics and its 95% credibility interval for ML models and LR were obtained were obtained using a Bayesian framework. Pooled estimates for ML models and LR were compared to inform on difference between the 2 approaches. Results: We identified 459 published citations of which 15 studies met inclusion criteria and were used for the quantitative and qualitative analysis. When the best ML model from individual study was used, meta-analytic estimates showed that ML were associated with a significantly higher C-statistic (ML, 0.88; 95% credibility interval, 0.83-0.93 vs LR, 0.81; 95% credibility interval, 0.77-0.85; P = .03). When individual ML algorithms were instead selected, we found a nonsignificant trend toward better prediction with each of ML algorithms. We found no evidence of publication bias (P = .70). Conclusions: The present findings suggest that when compared with LR, ML models provide better discrimination in mortality prediction after cardiac surgery. However, the magnitude and clinical influence of such an improvement remains uncertain.
引用
收藏
页码:2075 / +
页数:22
相关论文
共 50 条
[1]   Comparison of EuroSCORE II, Original EuroSCORE, and The Society of Thoracic Surgeons Risk Score in Cardiac Surgery Patients [J].
Ad, Niv ;
Holmes, Sari D. ;
Patel, Jay ;
Pritchard, Graciela ;
Shuman, Deborah J. ;
Halpin, Linda .
ANNALS OF THORACIC SURGERY, 2016, 102 (02) :573-579
[2]   A Comparison of a Machine Learning Model with EuroSCORE II in Predicting Mortality after Elective Cardiac Surgery: A Decision Curve Analysis [J].
Allyn, Jerome ;
Allou, Nicolas ;
Augustin, Pascal ;
Philip, Ivan ;
Martinet, Olivier ;
Belghiti, Myriem ;
Provenchere, Sophie ;
Montravers, Philippe ;
Ferdynus, Cyril .
PLOS ONE, 2017, 12 (01)
[3]   A Database-driven Decision Support System: Customized Mortality Prediction [J].
Celi, Leo Anthony ;
Galvin, Sean ;
Davidzon, Guido ;
Lee, Joon ;
Scott, Daniel ;
Mark, Roger .
JOURNAL OF PERSONALIZED MEDICINE, 2012, 2 (04) :138-148
[4]  
Chong Chee-Fah, 2003, AMIA Annu Symp Proc, P160
[5]   A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models [J].
Christodoulou, Evangelia ;
Ma, Jie ;
Collins, Gary S. ;
Steyerberg, Ewout W. ;
Verbakel, Jan Y. ;
Van Calster, Ben .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2019, 110 :12-22
[6]   A framework for meta-analysis of prediction model studies with binary and time-to-event outcomes [J].
Debray, Thomas P. A. ;
Damen, Johanna A. A. G. ;
Riley, Richard D. ;
Snell, Kym ;
Reitsma, Johannes B. ;
Hooft, Lotty ;
Collins, Gary S. ;
Moons, Karel G. M. .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2019, 28 (09) :2768-2786
[7]   Artificial neural networks [J].
Drew, PJ ;
Monson, JRT .
SURGERY, 2000, 127 (01) :3-11
[8]   Two new mathematical models for prediction of early mortality risk in coronary artery bypass graft surgery [J].
Ghavidel, Alireza Alizadeh ;
Javadikasgari, Hoda ;
Maleki, Majid ;
Karbassi, Arsha ;
Omrani, Gholamreza ;
Noohi, Feridoun .
JOURNAL OF THORACIC AND CARDIOVASCULAR SURGERY, 2014, 148 (04) :1291-+
[9]   The diagnostic odds ratio: a single indicator of test performance [J].
Glas, AS ;
Lijmer, JG ;
Prins, MH ;
Bonsel, GJ ;
Bossuyt, PMM .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2003, 56 (11) :1129-1135
[10]   An Empirical Overview of the No Free Lunch Theorem and Its Effect on Real-World Machine Learning Classification [J].
Gomez, David ;
Rojas, Alfonso .
NEURAL COMPUTATION, 2016, 28 (01) :216-228