Machine learning can predict survival of patients with heart failure from serum creatinine and ejection fraction alone

被引:249
作者
Chicco, Davide [1 ]
Jurman, Giuseppe [2 ]
机构
[1] Krembil Res Inst, Toronto, ON, Canada
[2] Fdn Bruno Kessler, Trento, Italy
关键词
Cardiovascular heart diseases; Heart failure; Serum creatinine; Ejection fraction; Medical records; Feature ranking; Feature selection; Biostatistics; Machine learning; Data mining; Biomedical informatics; LOGISTIC-REGRESSION; AMBULATORY PATIENTS; CLASSIFICATION; MORTALITY; DIAGNOSIS; SCORE; SELECTION; OUTCOMES; MODELS; RULES;
D O I
10.1186/s12911-020-1023-5
中图分类号
R-058 [];
学科分类号
摘要
Background Cardiovascular diseases kill approximately 17 million people globally every year, and they mainly exhibit as myocardial infarctions and heart failures. Heart failure (HF) occurs when the heart cannot pump enough blood to meet the needs of the body.Available electronic medical records of patients quantify symptoms, body features, and clinical laboratory test values, which can be used to perform biostatistics analysis aimed at highlighting patterns and correlations otherwise undetectable by medical doctors. Machine learning, in particular, can predict patients' survival from their data and can individuate the most important features among those included in their medical records. Methods In this paper, we analyze a dataset of 299 patients with heart failure collected in 2015. We apply several machine learning classifiers to both predict the patients survival, and rank the features corresponding to the most important risk factors. We also perform an alternative feature ranking analysis by employing traditional biostatistics tests, and compare these results with those provided by the machine learning algorithms. Since both feature ranking approaches clearly identify serum creatinine and ejection fraction as the two most relevant features, we then build the machine learning survival prediction models on these two factors alone. Results Our results of these two-feature models show not only that serum creatinine and ejection fraction are sufficient to predict survival of heart failure patients from medical records, but also that using these two features alone can lead to more accurate predictions than using the original dataset features in its entirety. We also carry out an analysis including the follow-up month of each patient: even in this case, serum creatinine and ejection fraction are the most predictive clinical features of the dataset, and are sufficient to predict patients' survival. Conclusions This discovery has the potential to impact on clinical practice, becoming a new supporting tool for physicians when predicting if a heart failure patient will survive or not. Indeed, medical doctors aiming at understanding if a patient will survive after heart failure may focus mainly on serum creatinine and ejection fraction.
引用
收藏
页数:16
相关论文
共 116 条
[1]   Survival analysis of heart failure patients: A case study [J].
Ahmad, Tanvir ;
Munir, Assia ;
Bhatti, Sajjad Haider ;
Aftab, Muhammad ;
Raza, Muhammad Ali .
PLOS ONE, 2017, 12 (07)
[2]   Machine Learning Methods Improve Prognostication, Identify Clinically Distinct Phenotypes, and Detect Heterogeneity in Response to Therapy in a Large Cohort of Heart Failure Patients [J].
Ahmad, Tariq ;
Lund, Lars H. ;
Rao, Pooja ;
Ghosh, Rohit ;
Warier, Prashant ;
Vaccaro, Benjamin ;
Dahlstrom, Ulf ;
O'Connor, Christopher M. ;
Felker, G. Michael ;
Desai, Nihar R. .
JOURNAL OF THE AMERICAN HEART ASSOCIATION, 2018, 7 (08)
[3]   Effect of elevated admission serum creatinine and its worsening on outcome in hospitalized patients with decompensated heart failure [J].
Akhter, MW ;
Aronson, D ;
Bitar, F ;
Khan, S ;
Singh, H ;
Singh, RP ;
Burger, AJ ;
Elkayam, U .
AMERICAN JOURNAL OF CARDIOLOGY, 2004, 94 (07) :957-+
[4]   Clinical applications of machine learning in cardiovascular disease and its relevance to cardiac imaging [J].
Al'Aref, Subhi J. ;
Anchouche, Khalil ;
Singh, Gurpreet ;
Slomka, Piotr J. ;
Kolli, Kranthi K. ;
Kumar, Amit ;
Pandey, Mohit ;
Maliakal, Gabriel ;
van Rosendael, Alexander R. ;
Beecy, Ashley N. ;
Berman, Daniel S. ;
Leipsic, Jonathan ;
Nieman, Koen ;
Andreini, Daniele ;
Pontone, Gianluca ;
Schoepf, U. Joseph ;
Shaw, Leslee J. ;
Chang, Hyuk-Jae ;
Narula, Jagat ;
Bax, Jeroen J. ;
Guan, Yuanfang ;
Min, James K. .
EUROPEAN HEART JOURNAL, 2019, 40 (24) :1975-+
[5]   Risk Prediction Models for Mortality in Ambulatory Patients With Heart Failure A Systematic Review [J].
Alba, Ana C. ;
Agoritsas, Thomas ;
Jankowski, Milosz ;
Courvoisier, Delphine ;
Walter, Stephen D. ;
Guyatt, Gordon H. ;
Ross, Heather J. .
CIRCULATION-HEART FAILURE, 2013, 6 (05) :881-889
[6]   Improving support vector machine classifiers by modifying kernel functions [J].
Amari, S ;
Wu, S .
NEURAL NETWORKS, 1999, 12 (06) :783-789
[7]   Cardiovascular Event Prediction by Machine Learning The Multi-Ethnic Study of Atherosclerosis [J].
Ambale-Venkatesh, Bharath ;
Yang, Xiaoying ;
Wu, Colin O. ;
Liu, Kiang ;
Hundley, W. Gregory ;
McClelland, Robyn ;
Gomes, Antoinette S. ;
Folsom, Aaron R. ;
Shea, Steven ;
Guallar, Eliseo ;
Bluemke, David A. ;
Lima, Joao A. C. .
CIRCULATION RESEARCH, 2017, 121 (09) :1092-+
[8]  
[Anonymous], RANDOM FORESTS GINI
[9]  
[Anonymous], SURVIVAL ANAL HEART
[10]  
[Anonymous], HIST ACAD ROYALE SCI