Predicting mortality among patients with liver cirrhosis in electronic health records with machine learning

被引:26
作者
Guo, Aixia [1 ]
Mazumder, Nikhilesh R. [2 ,3 ]
Ladner, Daniela P. [3 ,4 ]
Foraker, Randi E. [1 ,5 ]
机构
[1] Washington Univ, Sch Med, Inst Informat I2, St Louis, MO 63130 USA
[2] Northwestern Mem Hosp, Div Gastroenterol, Chicago, IL 60611 USA
[3] Northwestern Univ, Northwestern Univ Transplant Outcomes Res Collabo, Comprehens Transplant Ctr, Feinberg Sch Med, Chicago, IL 60611 USA
[4] Northwestern Med, Dept Surg, Div Transplant, Chicago, IL USA
[5] Washington Univ, Sch Med, Dept Internal Med, St Louis, MO 63110 USA
关键词
MODEL; TRANSPLANTATION; SCORES;
D O I
10.1371/journal.pone.0256428
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Objective Liver cirrhosis is a leading cause of death and effects millions of people in the United States. Early mortality prediction among patients with cirrhosis might give healthcare providers more opportunity to effectively treat the condition. We hypothesized that laboratory test results and other related diagnoses would be associated with mortality in this population. Our another assumption was that a deep learning model could outperform the current Model for End Stage Liver disease (MELD) score in predicting mortality. Materials and methods We utilized electronic health record data from 34,575 patients with a diagnosis of cirrhosis from a large medical center to study associations with mortality. Three time-windows of mortality (365 days, 180 days and 90 days) and two cases with different number of variables (all 41 available variables and 4 variables in MELD-NA) were studied. Missing values were imputed using multiple imputation for continuous variables and mode for categorical variables. Deep learning and machine learning algorithms, i.e., deep neural networks (DNN), random forest (RF) and logistic regression (LR) were employed to study the associations between baseline features such as laboratory measurements and diagnoses for each time window by 5-fold cross validation method. Metrics such as area under the receiver operating curve (AUC), overall accuracy, sensitivity, and specificity were used to evaluate models. Results Performance of models comprising all variables outperformed those with 4 MELD-NA variables for all prediction cases and the DNN model outperformed the LR and RF models. For example, the DNN model achieved an AUC of 0.88, 0.86, and 0.85 for 90, 180, and 365-day mortality respectively as compared to the MELD score, which resulted in corresponding AUCs of 0.81, 0.79, and 0.76 for the same instances. The DNN and LR models had a significantly better f1 score compared to MELD at all time points examined. Conclusion Other variables such as alkaline phosphatase, alanine aminotransferase, and hemoglobin were also top informative features besides the 4 MELD-Na variables. Machine learning and deep learning models outperformed the current standard of risk prediction among patients with cirrhosis. Advanced informatics techniques showed promise for risk prediction in patients with cirrhosis.
引用
收藏
页数:12
相关论文
共 33 条
[1]  
Alber M, 2019, J MACH LEARN RES, V20
[2]   Learning Deep Architectures for AI [J].
Bengio, Yoshua .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127
[3]   Evidence-based incorporation of serum sodium concentration into MELD [J].
Biggins, Scott W. ;
Kim, W. Ray ;
Terrault, Norah A. ;
Saab, Sammy ;
Balan, Vijay ;
Schiano, Thomas ;
Benson, Joanne ;
Therneau, Terry ;
Kremers, Walter ;
Wiesner, Russell ;
Kamath, Patrick ;
Klintmalm, Goran .
GASTROENTEROLOGY, 2006, 130 (06) :1652-1660
[4]   Defining a Patient Population With Cirrhosis: An Automated Algorithm With Natural Language Processing [J].
Chang, Edward K. ;
Yu, Christine Y. ;
Clarke, Robin ;
Hackbarth, Andrew ;
Sanders, Timothy ;
Esrailian, Eric ;
Hommes, Daniel W. ;
Runyon, Bruce A. .
JOURNAL OF CLINICAL GASTROENTEROLOGY, 2016, 50 (10) :889-894
[5]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[6]   Liver cirrhosis and cancer: comparison of mortality [J].
Chung, Wankyo ;
Jo, Changik ;
Chung, Woo Jin ;
Kim, Dong Joon .
HEPATOLOGY INTERNATIONAL, 2018, 12 (03) :269-276
[7]   Artificial neural network is superior to MELD in predicting mortality of patients with end-stage liver disease [J].
Cucchetti, A. ;
Vivarelli, M. ;
Heaton, N. D. ;
Phillips, S. ;
Piscaglia, F. ;
Bolondi, L. ;
La Barba, G. ;
Foxton, M. R. ;
Rela, M. ;
O'Grady, J. ;
Pinna, A. D. .
GUT, 2007, 56 (02) :253-258
[8]  
Damico G., NATURAL HIST PROGNOS, DOI [10.1016/j.jhep.2005.10.013, DOI 10.1016/J.JHEP.2005.10.013]
[9]   Are Synthetic Data Derivatives the Future of Translational Medicine? [J].
Foraker, Randi ;
Mann, Douglas L. ;
Payne, Philip R. O. .
JACC-BASIC TO TRANSLATIONAL SCIENCE, 2018, 3 (05) :716-718
[10]   Spot the difference: comparing results of analyses from real patient data and synthetic derivatives [J].
Foraker, Randi E. ;
Yu, Sean C. ;
Gupta, Aditi ;
Michelson, Andrew P. ;
Soto, Jose A. Pineda ;
Colvin, Ryan ;
Loh, Francis ;
Kollef, Marin H. ;
Maddox, Thomas ;
Evanoff, Bradley ;
Dror, Hovav ;
Zamstein, Noa ;
Lai, Albert M. ;
Payne, Philip R. O. .
JAMIA OPEN, 2020, 3 (04) :557-566