Predicting heart transplantation outcomes through data analytics

被引:75
作者
Dag, Ali [1 ]
Oztekin, Asil [2 ]
Yucel, Ahmet [3 ]
Bulur, Serkan [4 ]
Megahed, Fadel M. [5 ]
机构
[1] Univ South Dakota, Beacom Sch Business, Vermillion, SD 57069 USA
[2] Univ Massachusetts, Operat & Informat Syst, Lowell, MA 01854 USA
[3] Auburn Univ, Dept Math, Auburn, AL 36849 USA
[4] Univ Alabama Birmingham, Div Cardiovasc Dis, Birmingham, AL 35233 USA
[5] Miami Univ, Farmer Sch Business, Oxford, OH 45056 USA
关键词
Data mining; Health-care analytics; Medical decision making; Transplants; Unbalanced data; United Network for Organ Sharing (UNOS); CARDIAC TRANSPLANTATION; INTERNATIONAL SOCIETY; DONOR AGE; SURVIVAL; RISK; KIDNEY; TIME; ALGORITHM; MORTALITY; REGISTRY;
D O I
10.1016/j.dss.2016.10.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predicting the survival of heart transplant patients is an important, yet challenging problem since it plays a crucial role in understanding the matching procedure between a donor and a recipient. Data mining models can be used to effectively analyze and extract novel information from large/complex transplantation datasets. The objective of this study is to predict the 1-, 5-, and 9-year patient's graft survival following a heart transplant surgery via the deployment of analytical models that are based on four powerful classification algorithms (i.e. decision trees, artificial neural networks, support vector machines, and logistic regression). Since the datasets used in this study has a much larger number of survival cases than deaths for 1- and 5-year survival analysis and vice versa for 9-year survival analysis, random under sampling (RUS) and synthetic minority over-sampling (SMOTE) are employed to overcome the data-imbalance problems. The results indicate that logistic regression combined with SMOTE achieves the best classification for the 1-, 5-, and 9-year outcome prediction, with area-under-the-curve (AUC) values of 0.624, 0.676, and 0.838, respectively. By applying sensitivity analysis to the data analytical models, the most important predictors and their associated contribution for the 1-, 5-, and 9-year graft survival of heart transplant patients are identified. By doing so, variables, whose importance changes over time, are differentiated. Not only this proposed hybrid approach gives superior results over the literature but also the models and identification of the variables present important retrospective findings, which can be the basis for a prospective medical study. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:42 / 52
页数:11
相关论文
共 69 条
[41]   The Effect of Age, Diagnosis, and Previous Surgery in Children and Adults Undergoing Heart Transplantation for Congenital Heart Disease [J].
Lamour, Jacqueline M. ;
Kanter, Kirk R. ;
Naftel, David C. ;
Chrisant, Maryanne R. ;
Morrow, William R. ;
Clemson, Barry S. ;
Kirklin, James K. .
JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2009, 54 (02) :160-165
[42]   Single and multiple time-point prediction models in kidney transplant outcomes [J].
Lin, Ray S. ;
Horn, Susan D. ;
Hurdle, John F. ;
Goldfarb-Rumyantzev, Alexander S. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2008, 41 (06) :944-952
[43]  
Lopez-Sendon J., 2011, MEDICOGRAPHIA, V33, P363
[44]  
Lund L. H., J HEART LUNG TRANSPL, V32
[45]   Listing criteria for heart transplantation: International Society for Heart and Lung Transplantation guidelines for the care of cardiac transplant candidates - 2006 [J].
Mehra, Mandeep R. ;
Kobashigawa, Jon ;
Starling, Randall ;
Russell, Stuart ;
Uber, Patricia A. ;
Parameshwar, Jayan ;
Mohacsi, Paul ;
Augustine, Sharon ;
Aaronson, Keith ;
Barr, Mark .
JOURNAL OF HEART AND LUNG TRANSPLANTATION, 2006, 25 (09) :1024-1042
[46]   Artificial neural networks: powerful tools for modeling chaotic behavior in the nervous system [J].
Molaie, Malihe ;
Falahian, Razieh ;
Gharibzadeh, Shahriar ;
Jafari, Sajad ;
Sprott, Julien C. .
FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2014, 8
[47]   Infectious complications among 620 consecutive heart transplant patients at Stanford University Medical Center [J].
Montoya, JG ;
Giraldo, LF ;
Efron, B ;
Stinson, EB ;
Gamberg, P ;
Hung, S ;
Giannetti, N ;
Miller, J ;
Remington, JS .
CLINICAL INFECTIOUS DISEASES, 2001, 33 (05) :629-640
[48]   Algorithm to determine the outcome of patients with acute liver failure: a data-mining analysis using decision trees [J].
Nakayama, Nobuaki ;
Oketani, Makoto ;
Kawamura, Yoshihiro ;
Inao, Mie ;
Nagoshi, Sumiko ;
Fujiwara, Kenji ;
Tsubouchi, Hirohito ;
Mochida, Satoshi .
JOURNAL OF GASTROENTEROLOGY, 2012, 47 (06) :664-677
[49]  
Olshen L.B. J. F. R., 1984, Classification and regression trees
[50]   Development of a structural equation modeling-based decision tree methodology for the analysis of lung transplantations [J].
Oztekin, Asil ;
Kong, Zhenyu James ;
Delen, Dursun .
DECISION SUPPORT SYSTEMS, 2011, 51 (01) :155-166