Feature Engineering for ICU Mortality Prediction Based on Hourly to Bi-Hourly Measurements

被引:12
作者
Amer, Ahmed Y. A. [1 ,2 ]
Vranken, Julie [3 ,4 ,5 ]
Wouters, Femke [3 ,4 ,5 ]
Mesotten, Dieter [3 ,4 ,5 ]
Vandervoort, Pieter [3 ,4 ,5 ]
Storms, Valerie [3 ,4 ,5 ]
Luca, Stijn [6 ]
Vanrumste, Bart [1 ]
Aerts, Jean-Marie [2 ]
机构
[1] Katholieke Univ Leuven, E MEDIA, Dept Elect Engn ESAT STADIUS, ESAT TC, Campus Grp T, B-3000 Leuven, Belgium
[2] Katholieke Univ Leuven, Measure Model & Manage Bioresponses M3 BIORES, Dept Biosyst, B-3000 Leuven, Belgium
[3] Hasselt Univ, Fac Med & Life Sci, B-3500 Hasselt, Belgium
[4] Ziekenhuis Oost Limburg, Dept Anesthesiol, Dept Cardiol, B-3600 Genk, Belgium
[5] Dept Future Hlth, B-3600 Genk, Belgium
[6] Univ Ghent, Dept Data Anal & Math Modelling, B-9000 Ghent, Belgium
来源
APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 17期
关键词
feature engineering; intensive care unit; mortality prediction; hard-margin support vector machines; ACUTE INFLAMMATORY RESPONSE; REDUCED MATHEMATICAL-MODEL; LOW PULSE PRESSURE; SEPSIS;
D O I
10.3390/app9173525
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Mortality prediction for intensive care unit (ICU) patients is a challenging problem that requires extracting discriminative and informative features. This study presents a proof of concept for exploring features that can provide clinical insight. Through a feature engineering approach, it is attempted to improve ICU mortality prediction in field conditions with low frequently measured data (i.e., hourly to bi-hourly). Features are explored by investigating the vital signs measurements of ICU patients, labelled with mortality or survival at discharge. The vital signs of interest in this study are heart and respiration rate, oxygen saturation and blood pressure. The latter comprises systolic, diastolic and mean arterial pressure. In the feature exploration process, it is aimed to extract simple and interpretable features that can provide clinical insight. For this purpose, a classifier is required that maximises the margin between the two classes (i.e., survival and mortality) with minimum tolerance to misclassification errors. Moreover, it preferably has to provide a linear decision surface in the original feature space without mapping to an unlimited dimensionality feature space. Therefore, a linear hard margin support vector machine (SVM) classifier is suggested. The extracted features are grouped in three categories: statistical, dynamic and physiological. Each category plays an important role in enhancing classification error performance. After extracting several features within the three categories, a manual feature fine-tuning is applied to consider only the most efficient features. The final classification, considering mortality as the positive class, resulted in an accuracy of 91.56%, sensitivity of 90.59%, precision of 86.52% and F-1-score of 88.50%. The obtained results show that the proposed feature engineering approach and the extracted features are valid to be considered and further enhanced for the mortality prediction purpose. Moreover, the proposed feature engineering approach moved the modelling methodology from black-box modelling to grey-box modelling in combination with the powerful classifier of SVMs.
引用
收藏
页数:17
相关论文
共 37 条
[1]   From data patterns to mechanistic models in acute critical illness [J].
Aerts, Jean-Marie ;
Haddad, Wassim M. ;
An, Gary ;
Vodovotz, Yoram .
JOURNAL OF CRITICAL CARE, 2014, 29 (04) :604-610
[2]  
Akin S, 2018, EUR HEART J, V39, P5690
[3]   A widened pulse pressure: a potential valuable prognostic indicator of mortality in patients with sepsis [J].
Al-khalisy, Hassan ;
Nikiforov, Ivan ;
Jhajj, Manjit ;
Kodali, Namratha ;
Cheriyath, Pramil .
JOURNAL OF COMMUNITY HOSPITAL INTERNAL MEDICINE PERSPECTIVES, 2015, 5 (06)
[4]  
Alves T, 2018, IEEE INT CONF BIG DA, P1328, DOI 10.1109/BigData.2018.8621927
[5]  
[Anonymous], 2012, Learning_from_data
[6]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[7]   Unravelling post-ICU mortality: predictors and causes of death [J].
Braber, Annemarije ;
van Zanten, Arthur R. H. .
EUROPEAN JOURNAL OF ANAESTHESIOLOGY, 2010, 27 (05) :486-490
[8]   A Database-driven Decision Support System: Customized Mortality Prediction [J].
Celi, Leo Anthony ;
Galvin, Sean ;
Davidzon, Guido ;
Lee, Joon ;
Scott, Daniel ;
Mark, Roger .
JOURNAL OF PERSONALIZED MEDICINE, 2012, 2 (04) :138-148
[9]   A Clinical Database-Driven Approach to Decision Support: Predicting Mortality Among Patients with Acute Kidney Injury [J].
Celi, Leo Anthony G. ;
Tang, Robin J. ;
Villarroel, Mauricio C. ;
Davidzon, Guido A. ;
Lester, William T. ;
Chueh, Henry C. .
JOURNAL OF HEALTHCARE ENGINEERING, 2011, 2 (01) :97-109
[10]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411