Early prediction of end-stage kidney disease using electronic health record data: a machine learning approach with a 2-year horizon

被引:0
作者
Petousis, Panayiotis [1 ,7 ]
Wilson, James M. [2 ]
Gelvezon, Alex, V [3 ]
Alam, Shafiul [3 ]
Jain, Ankur [3 ]
Prichard, Laura [3 ]
Elashoff, David A. [4 ]
Raja, Naveen [5 ]
Bui, Alex A. T. [6 ]
机构
[1] Univ Calif Los Angeles UCLA, UCLA Hlth Clin & Translat Sci Inst, David Geffen Sch Med, Los Angeles, CA 90024 USA
[2] Univ Calif Los Angeles UCLA, David Geffen Sch Med, Dept Med, Los Angeles, CA 90024 USA
[3] Univ Calif Los Angeles UCLA, David Geffen Sch Med, UCLA Hlth Off Hlth Informat & Analyt, Los Angeles, CA 90024 USA
[4] Univ Calif Los Angeles UCLA, Biostat & Computat Med, David Geffen Sch Med, Los Angeles, CA 90024 USA
[5] Univ Calif Los Angeles UCLA, David Geffen Sch Med, Los Angeles, CA 90024 USA
[6] Univ Calif Los Angeles UCLA, David Geffen Sch Med, Dept Radiol Sci, Med & Imaging Informat MII Grp, Los Angeles, CA 90024 USA
[7] Univ Calif Los Angeles UCLA, UCLA Hlth Clin & Translat Sci Inst, David Geffen Sch Med, 924 Westwood Blvd Suite,420 Los Angeles, Los Angeles, CA 90024 USA
关键词
machine learning deployment; early prediction ESKD model; electronic health record; end-stage kidney disease (ESKD);
D O I
10.1093/jamiaopen/ooae015
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objectives In the United States, end-stage kidney disease (ESKD) is responsible for high mortality and significant healthcare costs, with the number of cases sharply increasing in the past 2 decades. In this study, we aimed to reduce these impacts by developing an ESKD model for predicting its occurrence in a 2-year period.Materials and Methods We developed a machine learning (ML) pipeline to test different models for the prediction of ESKD. The electronic health record was used to capture several kidney disease-related variables. Various imputation methods, feature selection, and sampling approaches were tested. We compared the performance of multiple ML models using area under the ROC curve (AUCROC), area under the Precision-Recall curve (PR-AUC), and Brier scores for discrimination, precision, and calibration, respectively. Explainability methods were applied to the final model.Results Our best model was a gradient-boosting machine with feature selection and imputation methods as additional components. The model exhibited an AUCROC of 0.97, a PR-AUC of 0.33, and a Brier score of 0.002 on a holdout test set. A chart review analysis by expert physicians indicated clinical utility.Discussion and Conclusion An ESKD prediction model can identify individuals at risk for ESKD and has been successfully deployed within our health system. End-stage kidney disease (ESKD) poses a substantial burden for mortality rate and healthcare costs in the United States. We developed and evaluated a machine learning (ML) model for predicting ESKD in 2 years using electronic health record (EHR) data. Various models were tested by leveraging EHR data and employing an ML pipeline. The developed model outperforms existing kidney failure models. Through a chart review, expert nephrologists affirmed the clinical utility of the model in predicting the outcome of complex cases. This model has been successfully integrated into our academic institution as part of a dashboard with visualizations and explainability for the model's predictions. In conclusion, the developed ESKD prediction model demonstrates the ability to identify individuals at risk for ESKD. Any future reduction in mortality and healthcare costs would showcase the effectiveness of our model.
引用
收藏
页数:9
相关论文
共 32 条
  • [1] [Anonymous], 2023, REPORT C MEDICARE PA
  • [2] Machine learning to predict end stage kidney disease in chronic kidney disease
    Bai, Qiong
    Su, Chunyan
    Tang, Wen
    Li, Yike
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [3] Bergstra J, 2012, J MACH LEARN RES, V13, P281
  • [4] Burrows NR, 2022, MMWR-MORBID MORTAL W, V71, P412, DOI 10.15585/mmwr.mm7111a3
  • [5] Machine Learning Improves Upon Clinicians' Prediction of End Stage Kidney Disease
    Chuah, Aaron
    Walters, Giles
    Christiadi, Daniel
    Karpe, Krishna
    Kennard, Alice
    Singer, Richard
    Talaulikar, Girish
    Ge, Wenbo
    Suominen, Hanna
    Andrews, T. Daniel
    Jiang, Simon
    [J]. FRONTIERS IN MEDICINE, 2022, 9
  • [6] Epidemiology of end-stage kidney disease
    Gupta, Ryan
    Woo, Karen
    Yi, Jeniann A.
    [J]. SEMINARS IN VASCULAR SURGERY, 2021, 34 (01) : 71 - 78
  • [7] Assessing Missing Data Assumptions in EHR-Based Studies: A Complex and Underappreciated Task
    Haneuse, Sebastien
    Arterburn, David
    Daniels, Michael J.
    [J]. JAMA NETWORK OPEN, 2021, 4 (02)
  • [8] hcup-us.ahrq.go, Chronic Condition Indicator Refined (CCIR) for ICD-10-CM
  • [9] /hcup-us.ahrq.go, HCUP-US Tools and Software Page CCS-Services and Procedures
  • [10] hcup-us.ahrq.go, Clinical Classifications Software (CCS) for ICD-9-CM