Machine Learning for Prediction of Patients on Hemodialysis with an Undetected SARS-CoV-2 Infection

被引:11
作者
Monaghan, Caitlin K. [1 ]
Larkin, John W. [1 ]
Chaudhuri, Sheetal [1 ,2 ]
Han, Hao [1 ]
Jiao, Yue [1 ]
Bermudez, Kristine M. [3 ]
Weinhandl, Eric D. [3 ]
Dahne-Steuber, Ines A. [3 ]
Belmonte, Kathleen [4 ]
Neri, Luca [5 ]
Kotanko, Peter [6 ,7 ]
Kooman, Jeroen P. [2 ]
Hymes, Jeffrey L. [3 ]
Kossmann, Robert J. [3 ]
Usvyat, Len A. [1 ]
Maddux, Franklin W. [8 ]
机构
[1] Fresenius Med Care, Global Med Off, 920 Winter St, Waltham, MA 02451 USA
[2] Maastricht Univ, Med Ctr, Div Nephrol, Maastricht, Netherlands
[3] Fresenius Med Care North Amer, Med Off, Waltham, MA USA
[4] Fresenius Kidney Care, Nursing & Clin Serv, Waltham, MA USA
[5] Fresenius Med Care, EMEA Med Off, Bad Homburg, Germany
[6] Renal Res Inst, Res Div, New York, NY USA
[7] Icahn Sch Med Mt Sinai, Div Nephrol, New York, NY 10029 USA
[8] Fresenius Med Care AG & Co KGaA, Global Med Off, Bad Homburg, Germany
来源
KIDNEY360 | 2021年 / 2卷 / 03期
关键词
dialysis; artificial intelligence; coronavirus; COVID-19; end stage kidney disease; machine learning; prediction; SARS-CoV-2; DIALYSIS PATIENTS; COVID-19; RISK; CARE; AI;
D O I
10.34067/KID.0003802020
中图分类号
R5 [内科学]; R69 [泌尿科学(泌尿生殖系疾病)];
学科分类号
1002 ; 100201 ;
摘要
Background We developed a machine learning (ML) model that predicts the risk of a patient on hemodialysis (HD) having an undetected SARS-CoV-2 infection that is identified after the following >= 3 days. Methods As part of a healthcare operations effort, we used patient data from a national network of dialysis clinics (February-September 2020) to develop an ML model (XGBoost) that uses 81 variables to predict the likelihood of an adult patient on HD having an undetected SARS-CoV-2 infection that is identified in the subsequent >= 3 days. We used a 60%:20%:20% randomized split of COVID-19-positive samples for the training, validation, and testing datasets. Results We used a select cohort of 40,490 patients on HD to build the ML model (11,166 patients who were COVID-19 positive and 29,324 patients who were unaffected controls). The prevalence of COVID-19 in the cohort (28% COVID-19 positive) was by design higher than the HD population. The prevalence of COVID-19 was set to 10% in the testing dataset to estimate the prevalence observed in the national HD population. The threshold for classifying observations as positive or negative was set at 0.80 to minimize false positives. Precision for the model was 0.52, the recall was 0.07, and the lift was 5.3 in the testing dataset. Area under the receiver operating characteristic curve (AUROC) and area under the precision-recall curve (AUPRC) for the model was 0.68 and 0.24 in the testing dataset, respectively. Top predictors of a patient on HD having a SARS-CoV-2 infection were the change in interdialytic weight gain from the previous month, mean pre-HD body temperature in the prior week, and the change in post-HD heart rate from the previous month. Conclusions The developed ML model appears suitable for predicting patients on HD at risk of having COVID-19 at least 3 days before there would be a clinical suspicion of the disease.
引用
收藏
页码:456 / 468
页数:13
相关论文
共 45 条
  • [1] Population-Based Estimates of Chronic Conditions Affecting Risk for Complications from Coronavirus Disease, United States
    Adams, Mary L.
    Katz, David L.
    Grandpre, Joseph
    [J]. EMERGING INFECTIOUS DISEASES, 2020, 26 (08) : 1831 - 1833
  • [2] Artificial intelligence and machine learning to fight COVID-19
    Alimadadi, Ahmad
    Aryal, Sachin
    Manandhar, Ishan
    Munroe, Patricia B.
    Joe, Bina
    Cheng, Xi
    [J]. PHYSIOLOGICAL GENOMICS, 2020, 52 (04) : 200 - 202
  • [3] Prevalence of SARS-CoV-2 antibodies in a large nationwide sample of patients on dialysis in the USA: a cross-sectional study
    Anand, Shuchi
    Montez-Rath, Maria
    Han, Jialin
    Bozeman, Julie
    Kerschmann, Russell
    Beyer, Paul
    Parsonnet, Julie
    Chertow, Glenn M.
    [J]. LANCET, 2020, 396 (10259) : 1335 - 1344
  • [4] [Anonymous], 2019, Am J Kidney Dis, DOI 10.1053/j.ajkd.2019.09.002
  • [5] [Anonymous], 2020, Johns Hopkins Coronavirus Resource Center
  • [6] Recommendations for the prevention, mitigation and containment of the emerging SARS-CoV-2 (COVID-19) pandemic in haemodialysis centres
    Basile, Carlo
    Combe, Christian
    Pizzarelli, Francesco
    Covic, Adrian
    Davenport, Andrew
    Kanbay, Mehmet
    Kirmizis, Dimitrios
    Schneditz, Daniel
    van der Sande, Frank
    Mitra, Sandip
    [J]. NEPHROLOGY DIALYSIS TRANSPLANTATION, 2020, 35 (05) : 737 - 741
  • [7] Pneumonia of unknown aetiology in Wuhan, China: potential for international spread via commercial air travel
    Bogoch, Isaac I.
    Watts, Alexander
    Thomas-Bachli, Andrea
    Huber, Carmen
    Kraemer, Moritz U. G.
    Khan, Kamran
    [J]. JOURNAL OF TRAVEL MEDICINE, 2020, 27 (02)
  • [8] CDC COVID-19 Response Team, 2020, MMWR-MORBID MORTAL W, V69, P343, DOI [10.15585/mmwr.mm6912e2, 10.15585/mmwr.mm6915e4]
  • [9] Centers for Medicare & Medicaid Services, 2020, PREL MED COVID 19 DA
  • [10] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794