Learning from class-imbalance and heterogeneous data for 30-day hospital readmission

被引:19
作者
Du, Guodong [1 ]
Zhang, Jia [1 ]
Li, Shaozi [1 ]
Li, Candong [2 ]
机构
[1] Xiamen Univ, Dept Artificial Intelligence, Xiamen 361005, Peoples R China
[2] Fujian Univ Tradit Chinese Med, Coll Tradit Chinese Med, Fuzhou 350122, Peoples R China
关键词
30-day readmission prediction; Heterogeneous data; Class-imbalance data; Sample weight learning; Large margin property; FEATURE-SELECTION; PREDICTION; FRAMEWORK; MODELS; TIME;
D O I
10.1016/j.neucom.2020.08.064
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predicting 30-day hospital readmission is a core research task in the development of personalized healthcare. However, the imbalanced class distribution and the heterogeneity of electronic health records are the major challenges to establish an effective machine learning model for this task. To overcome these issues, we propose a new 30-day readmission prediction algorithm to improve the performance. First, we solve the problem of class-imbalance readmission prediction by learning sample weights based on hypothesis margin loss. At the same time, we consider the character of data heterogeneity, and learn the weights of heterogeneous data sources to improve the generalization ability. Based on this, we construct an optimization framework, which involves two variables, i.e., sample weights and source weights. By iterative optimization, we obtain the prediction result for readmission. Finally, we conduct experiments on three real-world readmission datasets to verify the effectiveness of the proposed method. The experimental results show that the proposed algorithm has the advantages to deal with the task of 30-day hospital readmission prediction. (C) 2020 Published by Elsevier B.V.
引用
收藏
页码:27 / 35
页数:9
相关论文
共 58 条
[11]   Prediction of 30-Day Readmission: An Improved Gradient Boosting Decision Tree Approach [J].
Du, Guodong ;
Ma, Lei ;
Hu, Jin-Shan ;
Zhang, Junpeng ;
Xiang, Yan ;
Shao, Dangguo ;
Wang, Hongbin .
JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2019, 9 (03) :620-627
[12]  
Dumpala SH, 2018, IJCAI, P2100
[13]   A comparison of models for predicting early hospital readmissions [J].
Futoma, Joseph ;
Morris, Jonathan ;
Lucas, Joseph .
JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 56 :229-238
[14]   StageNet: Stage-Aware Neural Networks for Health Risk Prediction [J].
Gao, Junyi ;
Xiao, Cao ;
Wang, Yasha ;
Tang, Wen ;
Glass, Lucas M. ;
Sun, Jimeng .
WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, :530-540
[15]   Neural and statistical predictors for time to readmission in emergency departments: A case study [J].
Garmendia, Asier ;
Grana, Manuel ;
Manuel Lopez-Guede, Jose ;
Rios, Sebastian .
NEUROCOMPUTING, 2019, 354 :3-9
[16]  
Gilad-Bachrach R., 2004, MACH LEARN P 21 INT, P1
[17]   iFusion: Towards efficient intelligence fusion for deep learning from real-time and heterogeneous data [J].
Guo, Kehua ;
Xu, Tao ;
Kui, Xiaoyan ;
Zhang, Ruifang ;
Chi, Tao .
INFORMATION FUSION, 2019, 51 (215-223) :215-223
[18]   Joint multi-label classification and label correlations with missing labels and feature selection [J].
He, Zhi-Fen ;
Yang, Ming ;
Gao, Yang ;
Liu, Hui-Dong ;
Yin, Yilong .
KNOWLEDGE-BASED SYSTEMS, 2019, 163 :145-158
[19]   Diagnosis-specific readmission risk prediction using electronic health data: a retrospective cohort study [J].
Hebert, Courtney ;
Shivade, Chaitanya ;
Foraker, Randi ;
Wasserman, Jared ;
Roth, Caryn ;
Mekhjian, Hagop ;
Lemeshow, Stanley ;
Embi, Peter .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2014, 14
[20]  
Hosseinzadeh A., 2013, PROC 25 INNOV APPL A, P1532