Extensive phenotype data and machine learning in prediction of mortality in acute coronary syndrome - the MADDEC study

被引:49
作者
Hernesniemi, Jussi A. [1 ,2 ]
Mahdiani, Shadi [1 ,3 ]
Tynkkynen, Juho A. [1 ,4 ]
Lyytikainen, Leo-Pekka [1 ,2 ,5 ,6 ]
Mishra, Pashupati P. [5 ,6 ]
Lehtimaki, Terho [1 ,5 ,6 ]
Eskola, Markku [2 ]
Nikus, Kjell [1 ,2 ]
Antila, Kari [3 ]
Oksala, Niku [1 ,5 ,6 ,7 ]
机构
[1] Tampere Univ, Fac Med & Hlth Technol, Tampere, Finland
[2] Tampere Univ Hosp, Tays Heart Hosp, Dept Cardiol, Ensitie 4, Tampere 33520, Finland
[3] VTT Tech Res Ctr Finland, Tampere, Finland
[4] Kanta Hame Cent Hosp, Dept Radiol, Hameenlinna, Finland
[5] Tampere Univ, Fac Med & Hlth Technol, Fimlab Labs, Dept Clin Chem, Tampere, Finland
[6] Tampere Univ, Fac Med & Hlth Technol, Finnish Cardiovasc Res Ctr Tampere, Tampere, Finland
[7] Tampere Univ Hosp, Vasc & Intervent Radiol Ctr, Tampere, Finland
基金
芬兰科学院; 欧盟地平线“2020”;
关键词
Machine learning; risk factors; mortality; acute coronary syndrome; HOSPITAL DISCHARGE REGISTER; VALIDATION; FAILURE; MODEL;
D O I
10.1080/07853890.2019.1596302
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Objective: Investigation of the clinical potential of extensive phenotype data and machine learning (ML) in the prediction of mortality in acute coronary syndrome (ACS). Methods: The value of ML and extensive clinical data was analyzed in a retrospective registry study of 9066 consecutive ACS patients (January 2007 to October 2017). Main outcome was six-month mortality. Prediction models were developed using two ML methods, logistic regression and extreme gradient boosting (xgboost). The models were fitted in training set of patients treated in 2007-2014 and 2017 (81%, n = 7344) and validated in a separate validation set of patients treated in 2015-2016 with full GRACE score data available for comparison of model accuracy (19%, n = 1722). Results: Overall, six-month mortality was 7.3% (n = 660). Several variables were found to be significantly associated with six-month mortality by both ML methods. The xgboost scored the best performance: AUC 0.890 (0.864-0.916). The AUC values for logistic regression and GRACE score were 0.867(0.837-0.897) and 0.822 (0.785-0.859), respectively. The AUC value of xgboost was better when compared to logistic regression (p = .012) and GRACE score (p < .00001). Conclusions: The use of extensive phenotype data and novel machine learning improves prediction of mortality in ACS over traditional GRACE score.KEY MESSAGES The collection of extensive cardiovascular phenotype data from electronic health records as well as from data recorded by physicians can be used highly effectively in prediction of mortality after acute coronary syndrome. Supervised machine learning methods such as logistic regression and extreme gradient boosting using extensive phenotype data significantly outperform conventional risk assessment by the current golden standard GRACE score. Integration of electronic health records and the use of supervised machine learning methods can be easily applied in a single centre level to model the risk of mortality.
引用
收藏
页码:156 / 163
页数:8
相关论文
共 27 条
[1]   Rationale and design of the GRACE (Global Registry of Acute Coronary Events) Project:: A multinational registry of patients hospitalized with acute coronary syndromes [J].
Agnelli, G ;
Avezum, A ;
Brieger, D ;
Budaj, A ;
Cannon, CP ;
Goldberg, RJ ;
Goodman, S ;
Gulba, DC ;
Granger, C ;
Kennelly, BM ;
Gurfinkel, E ;
López-Sendón, J ;
Klein, W ;
Montalescot, G ;
Van de Werf, F .
AMERICAN HEART JOURNAL, 2001, 141 (02) :190-199
[2]   Rank-Based Inverse Normal Transformations are Increasingly Used, But are They Merited? [J].
Beasley, T. Mark ;
Erickson, Stephen ;
Allison, David B. .
BEHAVIOR GENETICS, 2009, 39 (05) :580-595
[3]   TIMI, GRACE and alternative risk scores in Acute Coronary Syndromes: A meta-analysis of 40 derivation studies on 216,552 patients and of 42 validation studies on 31,625 patients [J].
D'Ascenzo, Fabrizio ;
Biondi-Zoccai, Giuseppe ;
Moretti, Claudio ;
Bollati, Mario ;
Omede, Pierluigi ;
Sciuto, Filippo ;
Presutti, Davide G. ;
Modena, Maria Grazia ;
Gasparini, Mauro ;
Reed, Matthew J. ;
Sheiban, Imad ;
Gaita, Fiorenzo .
CONTEMPORARY CLINICAL TRIALS, 2012, 33 (03) :507-514
[4]   Prediction of risk of death and myocardial infarction in the six months after presentation with acute coronary syndrome: prospective multinational observational study (GRACE) [J].
Fox, Keith A. A. ;
Dabbous, Omar H. ;
Goldberg, Robert J. ;
Pieper, Karen S. ;
Eagle, Kim A. ;
Van de Werf, Frans ;
Avezum, Alvaro ;
Goodman, Shaun G. ;
Flather, Marcus D. ;
Anderson, Frederick A., Jr. ;
Granger, Christopher B. .
BMJ-BRITISH MEDICAL JOURNAL, 2006, 333 (7578) :1091-1094
[5]   Greedy function approximation: A gradient boosting machine [J].
Friedman, JH .
ANNALS OF STATISTICS, 2001, 29 (05) :1189-1232
[6]   Stochastic gradient boosting [J].
Friedman, JH .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2002, 38 (04) :367-378
[7]  
Goff DC, 2014, CIRCULATION, V129, pS49, DOI [10.1161/01.cir.0000437741.48606.98, 10.1016/j.jacc.2013.11.005]
[8]   Predictors of hospital mortality in the global registry of acute coronary events [J].
Granger, CB ;
Goldberg, RJ ;
Dabbous, O ;
Pieper, KS ;
Eagle, KA ;
Cannon, CP ;
Van de Werf, F ;
Avezum, A ;
Goodman, SG ;
Flather, MD ;
Fox, KAA .
ARCHIVES OF INTERNAL MEDICINE, 2003, 163 (19) :2345-2353
[9]   Big data from electronic health records for early and late translational cardiovascular research: challenges and potential [J].
Hemingway, Harry ;
Asselbergs, Folkert W. ;
Danesh, John ;
Dobson, Richard ;
Maniadakis, Nikolaos ;
Maggioni, Aldo ;
van Thiel, Ghislaine J. M. ;
Cronin, Maureen ;
Brobert, Gunnar ;
Vardas, Panos ;
Anker, Stefan D. ;
Grobbee, Diederick E. ;
Denaxas, Spiros .
EUROPEAN HEART JOURNAL, 2018, 39 (16) :1481-+
[10]  
Hernesniemi JA, EMBEC NBC 2017 IFMBE, V65