An improved support vector machine-based diabetic readmission prediction

被引:68
作者
Cui, Shaoze [1 ]
Wang, Dujuan [2 ]
Wang, Yanzhang [1 ]
Yu, Pay-Wen [3 ]
Jin, Yaochu [1 ,4 ]
机构
[1] Dalian Univ Technol, Sch Management Sci & Engn, Dalian 116023, Peoples R China
[2] Sichuan Univ, Business Sch, Chengdu 610064, Sichuan, Peoples R China
[3] Fu Jen Catholic Univ, Dept Phys Educ, New Taipei 24205, Taiwan
[4] Univ Surrey, Dept Comp Sci, Guildford GU2 7XH, Surrey, England
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Hospital readmission; Diabetes; Support vector machine; Synthetic minority over-sampling; Feature selection; DATA MINING APPROACH; 30-DAY READMISSION; IMBALANCED CLASSIFICATION; LOGISTIC-REGRESSION; HEART-FAILURE; LACE INDEX; REHOSPITALIZATION; HOSPITALIZATION; SELECTION; RISK;
D O I
10.1016/j.cmpb.2018.10.012
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and objective: In healthcare systems, the cost of unplanned readmission accounts for a large proportion of total hospital payment. Hospital-specific readmission rate becomes a critical issue around the world. Quantification and early identification of unplanned readmission risks will improve the quality of care during hospitalization and reduce the occurrence of readmission. In clinical practice, medical workers generally use LACE score method to evaluate patient readmission risks, but this method usually performs poorly. With this in mind, this study presents a novel method combining support vector machine and genetic algorithm to build the risk prediction model, which simultaneously involves feature selection and the processing of imbalanced data. This model aims to provide decision support for clinicians during the discharge management of patients with diabetes. Method: The experiments were conducted from a set of 8756 medical records with 50 different features about diabetic readmission. After preprocessing the data, an effective SMOTE-based method was proposed to solve the imbalance data problem. Further, in order to improve prediction performance, a hybrid feature selection mechanism was devised to select the important features. Subsequently, an improved support vector machine-based (SVM-based) method was developed and the genetic algorithm was used to tune the sensitive parameter of the algorithm. Finally, the five-fold cross-validation method was applied to compare the performance of proposed method with other methods (LACE score, logistic regression, naive bayes, decision tree and feed forward neural networks). Results: Experimental results indicate that the proposed SVM-based method achieves an accuracy of 81.02%, a sensitivity of 82.89%, a specificity of 79.23%, and outperforms other popular algorithms in identifying diabetic patients who may be readmitted. Conclusions: Our research can improve the performance of clinic decision support systems for diabetic readmission, by which the readmission possibility as well as the waste of medical resources can be reduced. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:123 / 135
页数:13
相关论文
共 61 条
[1]   Selection bias in gene extraction on the basis of microarray gene-expression data [J].
Ambroise, C ;
McLachlan, GJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (10) :6562-6566
[2]  
[Anonymous], 2015, IDF Diabetes Atlas, V7
[3]  
[Anonymous], 2017, SCHIZOPHR RES
[4]   Computer aided decision making for heart disease detection using hybrid neural network-Genetic algorithm [J].
Arabasadi, Zeinab ;
Alizadehsani, Roohallah ;
Roshanzamir, Mohamad ;
Moosaei, Hossein ;
Yarifard, Ali Asghar .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2017, 141 :19-26
[5]   The study of under- and over-sampling methods' utility in analysis of highly imbalanced data on osteoporosis [J].
Bach, M. ;
Werner, A. ;
Zywiec, J. ;
Pluskiewicz, W. .
INFORMATION SCIENCES, 2017, 384 :174-190
[6]   Predicting Breast Screening Attendance Using Machine Learning Techniques [J].
Baskaran, Vikraman ;
Guergachi, Aziz ;
Bali, Rajeev K. ;
Naguib, Raouf N. G. .
IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2011, 15 (02) :251-259
[7]   Analyzing Hospital Readmissions Using Creatinine Results for Patients with Many Visits [J].
Ben-Assuli, Ofir ;
Padman, Rema ;
Leshno, Moshe ;
Shabtai, Itamar .
7TH INTERNATIONAL CONFERENCE ON EMERGING UBIQUITOUS SYSTEMS AND PERVASIVE NETWORKS (EUSPN 2016)/THE 6TH INTERNATIONAL CONFERENCE ON CURRENT AND FUTURE TRENDS OF INFORMATION AND COMMUNICATION TECHNOLOGIES IN HEALTHCARE (ICTH-2016), 2016, 98 :357-361
[8]   Boosting for high-dimensional two-class prediction [J].
Blagus, Rok ;
Lusa, Lara .
BMC BIOINFORMATICS, 2015, 16
[9]  
Centers for Medicare and Medicaid Services, 2012, READM RED PROGR
[10]  
Cervantes J., 2015, NEUROCOMPUTING, V9227, P187