Comparison of Machine Learning Algorithms for Predicting Hospital Readmissions and Worsening Heart Failure Events in Patients With Heart Failure With Reduced Ejection Fraction: Modeling Study

被引:16
作者
Ru, Boshu [1 ]
Tan, Xi [1 ]
Liu, Yu [1 ]
Kannapur, Kartik [2 ]
Ramanan, Dheepan [2 ]
Kessler, Garin [2 ,3 ]
Lautsch, Dominik [1 ]
Fonarow, Gregg [4 ,5 ]
机构
[1] Merck & Co Inc, Rahway, NJ USA
[2] Amazon Web Serv Inc, Seattle, WA USA
[3] Georgetown Univ, Sch Continuing Studies, Washington, DC USA
[4] Univ Calif Los Angeles, Ahmanson UCLA Cardiomyopathy Ctr, Los Angeles, CA USA
[5] Univ Calif Los Angeles, Ahmanson UCLA Cardiomyopathy Ctr, 10833 LeConte Ave, Los Angeles, CA 90095 USA
关键词
deep learning; machine learning; hospital readmission; heart failure; heart failure with reduced ejection fraction; worsening heart failure event; Bidirectional Encoder Representations From Transformers; BERT; clinical registry; medical claims; real-world data;
D O I
10.2196/41775
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Heart failure (HF) is highly prevalent in the United States. Approximately one-third to one-half of HF cases are categorized as HF with reduced ejection fraction (HFrEF). Patients with HFrEF are at risk of worsening HF, have a high risk of adverse outcomes, and experience higher health care use and costs. Therefore, it is crucial to identify patients with HFrEF who are at high risk of subsequent events after HF hospitalization.Objective: Machine learning (ML) has been used to predict HF-related outcomes. The objective of this study was to compare different ML prediction models and feature construction methods to predict 30-, 90-, and 365-day hospital readmissions and worsening HF events (WHFEs).Methods: We used the Veradigm PINNACLE outpatient registry linked to Symphony Health's Integrated Dataverse data from July 1, 2013, to September 30, 2017. Adults with a confirmed diagnosis of HFrEF and HF-related hospitalization were included. WHFEs were defined as HF-related hospitalizations or outpatient intravenous diuretic use within 1 year of the first HF hospitalization. We used different approaches to construct ML features from clinical codes, including frequencies of clinical classification software (CCS) categories, Bidirectional Encoder Representations From Transformers (BERT) trained with CCS sequences (BERT + CCS), BERT trained on raw clinical codes (BERT + raw), and prespecified features based on clinical knowledge. A multilayer perceptron neural network, extreme gradient boosting (XGBoost), random forest, and logistic regression prediction models were applied and compared.Results: A total of 30,687 adult patients with HFrEF were included in the analysis; 11.41% (3184/27,917) of adults experienced a hospital readmission within 30 days of their first HF hospitalization, and nearly half (9231/21,562, 42.81%) of the patients experienced at least 1 WHFE within 1 year after HF hospitalization. The prediction models and feature combinations with the best area under the receiver operating characteristic curve (AUC) for each outcome were XGBoost with CCS frequency (AUC=0.595) for 30-day readmission, random forest with CCS frequency (AUC=0.630) for 90-day readmission, XGBoost with CCS frequency (AUC=0.649) for 365-day readmission, and XGBoost with CCS frequency (AUC=0.640) for WHFEs. Our ML models could discriminate between readmission and WHFE among patients with HFrEF. Our model performance was mediocre, especially for the 30-day readmission events, most likely owing to limitations of the data, including an imbalance between positive and negative cases and high missing rates of many clinical variables and outcome definitions.Conclusions: We predicted readmissions and WHFEs after HF hospitalizations in patients with HFrEF. Features identified by data-driven approaches may be comparable with those identified by clinical domain knowledge. Future work may be warranted to validate and improve the models using more longitudinal electronic health records that are complete, are comprehensive, and have a longer follow-up time.(JMIR Form Res 2023;7:e41775) doi: 10.2196/41775
引用
收藏
页数:17
相关论文
共 51 条
[1]  
[Anonymous], 2013, Applied logistic regression
[2]  
[Anonymous], 2022, HEART FAIL
[3]  
[Anonymous], STUD VER PART HEART
[4]   Machine learning-based prediction of heart failure readmission or death: implications of choosing the right model and the right metrics [J].
Awan, Saqib Ejaz ;
Bennamoun, Mohammed ;
Sohel, Ferdous ;
Sanfilippo, Frank Mario ;
Dwivedi, Girish .
ESC HEART FAILURE, 2019, 6 (02) :428-435
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]   Clinical and Economic Burden of Chronic Heart Failure and Reduced Ejection Fraction Following a Worsening Heart Failure Event [J].
Butler, Javed ;
Djatche, Laurence M. ;
Sawhney, Baanie ;
Chakladar, Sreya ;
Yang, Lingfeng ;
Brady, Joanne E. ;
Yang, Mei .
ADVANCES IN THERAPY, 2020, 37 (09) :4015-4032
[7]   Clinical Course of Patients With Worsening Heart Failure With Reduced Ejection Fraction [J].
Butler, Javed ;
Yang, Mei ;
Manzi, Massimiliano Alfonzo ;
Hess, Gregory P. ;
Patel, Mahesh J. ;
Rhodes, Thomas ;
Givertz, Michael M. .
JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2019, 73 (08) :935-944
[8]   Recurrent Neural Networks for Early Detection of Heart Failure From Longitudinal Electronic Health Record Data Implications for Temporal Modeling With Respect to Time Before Diagnosis, Data Density, Data Quantity, and Data Type [J].
Chen, Robert ;
Stewart, Walter F. ;
Sun, Jimeng ;
Ng, Kenney ;
Yan, Xiaowei .
CIRCULATION-CARDIOVASCULAR QUALITY AND OUTCOMES, 2019, 12 (10)
[9]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[10]   GRAM: Graph-based Attention Model for Healthcare Representation Learning [J].
Choi, Edward ;
Bahadori, Mohammad Taha ;
Song, Le ;
Stewart, Walter F. ;
Sun, Jimeng .
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, :787-795