Feature selection and transformation by machine learning reduce variable numbers and improve prediction for heart failure readmission or death

被引:39
作者
Awan, Saqib E. [1 ]
Bennamoun, Mohammed [1 ]
Sohel, Ferdous [1 ,2 ]
Sanfilippo, Frank M. [3 ]
Chow, Benjamin J. [4 ]
Dwivedi, Girish [5 ,6 ,7 ]
机构
[1] Univ Western Australia, Dept Comp Sci & Software Engn, Perth, WA, Australia
[2] Murdoch Univ, Discipline Informat Technol Math & Stat, Perth, WA, Australia
[3] Univ Western Australia, Sch Populat & Global Hlth, Perth, WA, Australia
[4] Univ Ottawa, Heart Inst, Ottawa, ON, Canada
[5] Univ Western Australia, Harry Perkins Inst Med Res, Perth, WA, Australia
[6] Univ Western Australia, Fiona Stanley Hosp, Perth, WA, Australia
[7] Univ Western Australia, Med Sch, Perth, WA, Australia
基金
英国医学研究理事会;
关键词
D O I
10.1371/journal.pone.0218760
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background The prediction of readmission or death after a hospital discharge for heart failure (HF) remains a major challenge. Modern healthcare systems, electronic health records, and machine learning (ML) techniques allow us to mine data to select the most significant variables (allowing for reduction in the number of variables) without compromising the performance of models used for prediction of readmission and death. Moreover, ML methods based on transformation of variables may potentially further improve the performance. Objective To use ML techniques to determine the most relevant and also transform variables for the prediction of 30-day readmission or death in HF patients. Methods We identified all Western Australian patients aged 65 years and above admitted for HF between 2003-2008 in linked administrative data. We evaluated variables associated with HF readmission or death using standard statistical and ML based selection techniques. We also tested the new variables produced by transformation of the original variables. We developed multi-layer perceptron prediction models and compared their predictive performance using metrics such as Area Under the receiver operating characteristic Curve (AUC), sensitivity and specificity. Results Following hospital discharge, the proportion of 30-day readmissions or death was 23.7% in our cohort of 10,757 HF patients. The prediction model developed by us using a smaller set of variables (n = 8) had comparable performance (AUC 0.62) to the traditional model (n = 47, AUC 0.62). Transformation of the original 47 variables further improved (p<0.001) the performance of the predictive model (AUC 0.66). Conclusions A small set of variables selected using ML matched the performance of the model that used the full set of 47 variables for predicting 30-day readmission or death in HF patients. Model performance can be further significantly improved by transforming the original variables using ML methods.
引用
收藏
页数:13
相关论文
共 28 条
[1]  
[Anonymous], SCI INF C SAI 2014
[2]  
[Anonymous], INT ENCY STAT SCI
[3]   Machine learning in heart failure: ready for prime time [J].
Awan, Saqib Ejaz ;
Sohel, Ferdous ;
Sanfilippo, Frank Mario ;
Bennamoun, Mohammed ;
Dwivedi, Girish .
CURRENT OPINION IN CARDIOLOGY, 2018, 33 (02) :190-195
[4]  
Awan SE, 2019, ESC HEART FAILURE
[5]   Feature selection in machine learning: A new perspective [J].
Cai, Jie ;
Luo, Jiawei ;
Wang, Shulin ;
Yang, Sheng .
NEUROCOMPUTING, 2018, 300 :70-79
[6]  
Cateni S, 2013, MULTIVARIATE ANAL MA, P103
[7]   Potentially Avoidable 30-Day Hospital Readmissions in Medical Patients Derivation and Validation of a Prediction Model [J].
Donze, Jacques ;
Aujesky, Drahomir ;
Williams, Deborah ;
Schnipper, Jeffrey L. .
JAMA INTERNAL MEDICINE, 2013, 173 (08) :632-638
[8]   A comparison of models for predicting early hospital readmissions [J].
Futoma, Joseph ;
Morris, Jonathan ;
Lucas, Joseph .
JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 56 :229-238
[9]   Long-term use and cost-effectiveness of secondary prevention drugs for heart disease in Western Australian seniors (WAMACH): a study protocol [J].
Gunnell, Anthony S. ;
Knuiman, Matthew W. ;
Geelhoed, Elizabeth ;
Hobbs, Michael S. T. ;
Katzenellenbogen, Judith M. ;
Hung, Joseph ;
Rankin, Jamie M. ;
Nedkoff, Lee ;
Briffa, Thomas G. ;
Ortiz, Michael ;
Gillies, Malcolm ;
Cordingley, Anne ;
Messer, Mitch ;
Gardner, Christian ;
Lopez, Derrick ;
Atkins, Emily ;
Mai, Qun ;
Sanfilippo, Frank M. .
BMJ OPEN, 2014, 4 (09)
[10]   The Hospital Readmissions Reduction Program-learning from failure of a healthcare policy [J].
Gupta, Ankur ;
Fonarow, Gregg C. .
EUROPEAN JOURNAL OF HEART FAILURE, 2018, 20 (08) :1169-1174