Early Sepsis Prediction Using Ensemble Learning With Deep Features and Artificial Features Extracted From Clinical Electronic Health Records

被引:23
作者
He, Zhengling [1 ,2 ]
Du, Lidong [2 ,3 ]
Zhang, Pengfei [1 ,2 ]
Zhao, Rongjian [2 ,3 ]
Chen, Xianxiang [1 ,2 ,3 ]
Fang, Zhen [1 ,2 ,3 ]
机构
[1] Univ Chinese Acad Sci, Beijing, Peoples R China
[2] Chinese Acad Sci, Aerosp Informat Res Inst, State Key Lab Transducer Technol, Beijing, Peoples R China
[3] Chinese Acad Med Sci, Res Unit Personalized Management Chron Resp Dis, Beijing, Peoples R China
关键词
clinical electronic health records; ensemble learning; long short-term memory neural network; sepsis;
D O I
10.1097/CCM.0000000000004644
中图分类号
R4 [临床医学];
学科分类号
1002 ; 100602 ;
摘要
Objectives: Sepsis is caused by infection and subsequent overreaction of immune system and will severely threaten human life. The early prediction is important for the treatment of sepsis. This report aims to develop an early prediction method for sepsis 6 hours ahead on the basis of clinical electronic health records. Data Sources: Challenge data are released by PhysioNet/Computing in Cardiology Challenge 2019 and obtained from ICU patients in three separate hospital systems. Part of the data from two datasets, including 40,336 subjects, are publicly available, and the remaining are used as hidden test set. A normalized utility score defined by the organizing committee is used for model performance evaluation. Study Selection: The supervised machine learning is applied to tackle this challenge. Specifically, we establish the prediction model under the framework of ensemble learning by integrating the artificial features based on clinical prior knowledge of sepsis with deep features automatically extracted by long short-term memory neural network. Data Extraction: Forty clinical variables, including eight vital signs, 26 laboratory values, and six demographics, were measured and recorded once an hour for each individual, and the binary label (0 or 1) was simultaneously provided for each item. Data Synthesis: The proposed model was evaluated by 30-fold cross-validation. The sensitivity, specificity, and normalized utility score were 0.641 +/- 0.022, 0.844 +/- 0.007, and 0.401 +/- 0.019 on publicly available datasets, respectively. The final normalized utility score our team (UCAS_DataMiner) has obtained was 0.313 on full hidden test set (0.406, 0.373, and -0.215 on test set A, B, and C, respectively). Conclusions: We realized a 6-hour ahead early-onset prediction of sepsis on the basis of clinical electronic health record by ensemble learning. The results indicated the proposed model functioned well in the early prediction of sepsis. In particular, ensemble learning had a significant (p < 0.01) improvement than any single model in performance.
引用
收藏
页码:E1337 / E1342
页数:6
相关论文
共 15 条
[1]  
[Anonymous], 2013, International Journal of Computer Science and Network
[2]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[3]   Prediction of Sepsis in the Intensive Care Unit With Minimal Electronic Health Record Data: A Machine Learning Approach [J].
Desautels, Thomas ;
Calvert, Jacob ;
Hoffman, Jana ;
Jay, Melissa ;
Kerem, Yaniv ;
Shieh, Lisa ;
Shimabukuro, David ;
Chettipally, Uli ;
Feldman, Mitchell D. ;
Barton, Chris ;
Wales, David J. ;
Das, Ritankar .
JMIR MEDICAL INFORMATICS, 2016, 4 (03) :67-81
[4]  
Hochreiter S., 1997, Neural Computation, V9, P1735
[5]  
Ke GL, 2017, ADV NEUR IN, V30
[6]   Duration of hypotension before initiation of effective antimicrobial therapy is the critical determinant of survival in human septic shock [J].
Kumar, Arland ;
Roberts, Daniel ;
Wood, Kenneth E. ;
Light, Bruce ;
Parrillo, Joseph E. ;
Sharma, Satendra ;
Suppes, Robert ;
Feinstein, Daniel ;
Zanotti, Sergio ;
Taiberg, Leo ;
Gurka, David ;
Kumar, Aseem ;
Cheang, Mary .
CRITICAL CARE MEDICINE, 2006, 34 (06) :1589-1596
[7]  
Nemati S, 2018, CRIT CARE MED, V46, P547, DOI [10.1097/CCM.0000000000002936, 10.1097/ccm.0000000000002936]
[8]   A Survey on Transfer Learning [J].
Pan, Sinno Jialin ;
Yang, Qiang .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (10) :1345-1359
[9]   Early Prediction of Sepsis From Clinical Data: The PhysioNet/Computing in Cardiology Challenge 2019 [J].
Reyna, Matthew A. ;
Josef, Christopher S. ;
Jeter, Russell ;
Shashikumar, Supreeth P. ;
Westover, M. Brandon ;
Nemati, Shamim ;
Clifford, Gari D. ;
Sharma, Ashish .
CRITICAL CARE MEDICINE, 2020, 48 (02) :210-217
[10]   Ensemble learning: A survey [J].
Sagi, Omer ;
Rokach, Lior .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 8 (04)