Learning from class-imbalance and heterogeneous data for 30-day hospital readmission

被引:18
作者
Du, Guodong [1 ]
Zhang, Jia [1 ]
Li, Shaozi [1 ]
Li, Candong [2 ]
机构
[1] Xiamen Univ, Dept Artificial Intelligence, Xiamen 361005, Peoples R China
[2] Fujian Univ Tradit Chinese Med, Coll Tradit Chinese Med, Fuzhou 350122, Peoples R China
关键词
30-day readmission prediction; Heterogeneous data; Class-imbalance data; Sample weight learning; Large margin property; FEATURE-SELECTION; PREDICTION; FRAMEWORK; MODELS; TIME;
D O I
10.1016/j.neucom.2020.08.064
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predicting 30-day hospital readmission is a core research task in the development of personalized healthcare. However, the imbalanced class distribution and the heterogeneity of electronic health records are the major challenges to establish an effective machine learning model for this task. To overcome these issues, we propose a new 30-day readmission prediction algorithm to improve the performance. First, we solve the problem of class-imbalance readmission prediction by learning sample weights based on hypothesis margin loss. At the same time, we consider the character of data heterogeneity, and learn the weights of heterogeneous data sources to improve the generalization ability. Based on this, we construct an optimization framework, which involves two variables, i.e., sample weights and source weights. By iterative optimization, we obtain the prediction result for readmission. Finally, we conduct experiments on three real-world readmission datasets to verify the effectiveness of the proposed method. The experimental results show that the proposed algorithm has the advantages to deal with the task of 30-day hospital readmission prediction. (C) 2020 Published by Elsevier B.V.
引用
收藏
页码:27 / 35
页数:9
相关论文
共 58 条
  • [11] Dumpala S. H., 2018, IJCAI, P2100
  • [12] A comparison of models for predicting early hospital readmissions
    Futoma, Joseph
    Morris, Jonathan
    Lucas, Joseph
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 56 : 229 - 238
  • [13] StageNet: Stage-Aware Neural Networks for Health Risk Prediction
    Gao, Junyi
    Xiao, Cao
    Wang, Yasha
    Tang, Wen
    Glass, Lucas M.
    Sun, Jimeng
    [J]. WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 530 - 540
  • [14] Neural and statistical predictors for time to readmission in emergency departments: A case study
    Garmendia, Asier
    Grana, Manuel
    Manuel Lopez-Guede, Jose
    Rios, Sebastian
    [J]. NEUROCOMPUTING, 2019, 354 : 3 - 9
  • [15] Gilad-Bachrach R., 2004, MACH LEARN P 21 INT, P1
  • [16] iFusion: Towards efficient intelligence fusion for deep learning from real-time and heterogeneous data
    Guo, Kehua
    Xu, Tao
    Kui, Xiaoyan
    Zhang, Ruifang
    Chi, Tao
    [J]. INFORMATION FUSION, 2019, 51 (215-223) : 215 - 223
  • [17] Joint multi-label classification and label correlations with missing labels and feature selection
    He, Zhi-Fen
    Yang, Ming
    Gao, Yang
    Liu, Hui-Dong
    Yin, Yilong
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 163 : 145 - 158
  • [18] Diagnosis-specific readmission risk prediction using electronic health data: a retrospective cohort study
    Hebert, Courtney
    Shivade, Chaitanya
    Foraker, Randi
    Wasserman, Jared
    Roth, Caryn
    Mekhjian, Hagop
    Lemeshow, Stanley
    Embi, Peter
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2014, 14
  • [19] Hosseinzadeh A., 2013, PROC 25 INNOV APPL A, P1532
  • [20] Large-margin nearest neighbor classifiers via sample weight learning
    Hu, Qinghua
    Zhu, Pengfei
    Yang, Yongbin
    Yu, Daren
    [J]. NEUROCOMPUTING, 2011, 74 (04) : 656 - 660