Predicting Coronavirus Disease 2019 Infection Risk and Related Risk Drivers in Nursing Homes: A Machine Learning Approach

被引:29
|
作者
Sun, Christopher L. F. [1 ,2 ]
Zuccarelli, Eugenio [1 ,3 ]
Zerhouni, El Ghali A. [1 ,3 ]
Lee, Jason [1 ,4 ]
Muller, James [5 ]
Scott, Karen M. [1 ]
Lujan, Alida M. [1 ]
Levi, Retsef [1 ]
机构
[1] MIT, Sloan Sch Management, 100 Main St,Room E62-562, Cambridge, MA 02142 USA
[2] Massachusetts Gen Hosp, Healthcare Syst Engn, Boston, MA 02114 USA
[3] MIT, Operat Res Ctr, Cambridge, MA 02142 USA
[4] MIT, Sch Engn, Cambridge, MA 02142 USA
[5] Muller Consulting & Data Analyt LLC, Washington, DC USA
基金
加拿大健康研究院;
关键词
Nursing homes; COVID-19; infection prevention; health policy; long-term care facility; risk modeling; machine-learning; COVID-19; DISPARITIES; QUALITY;
D O I
10.1016/j.jamda.2020.08.030
中图分类号
R592 [老年病学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 100203 ;
摘要
Objective: Inform coronavirus disease 2019 (COVID-19) infection prevention measures by identifying and assessing risk and possible vectors of infection in nursing homes (NHs) using a machine-learning approach. Design: This retrospective cohort study used a gradient boosting algorithm to evaluate risk of COVID-19 infection (ie, presence of at least 1 confirmed COVID-19 resident) in NHs. Setting and Participants: The model was trained on outcomes from 1146 NHs in Massachusetts, Georgia, and New Jersey, reporting COVID-19 case data on April 20, 2020. Risk indices generated from the model using data from May 4 were prospectively validated against outcomes reported on May 11 from 1021 NHs in California. Methods: Model features, pertaining to facility and community characteristics, were obtained from a self-constructed dataset based on multiple public and private sources. The model was assessed via out-of-sample area under the receiver operating characteristic curve (AUC), sensitivity, and specificity in the training (via 10-fold cross-validation) and validation datasets. Results: The mean AUC, sensitivity, and specificity of the model over 10-fold cross-validation were 0.729 [95% confidence interval (CI) 0.690.0.767], 0.670 (95% CI 0.477.0.862), and 0.611 (95% CI 0.412-0.809), respectively. Prospective out-of-sample validation yielded similar performance measures (AUC 0.721; sensitivity 0.622; specificity 0.713). The strongest predictors of COVID-19 infection were identified as the NH's county's infection rate and the number of separate units in the NH; other predictors included the county's population density, historical Centers of Medicare and Medicaid Services cited health deficiencies, and the NH's resident density (in persons per 1000 square feet). In addition, the NH's historical percentage of non-Hispanic white residents was identified as a protective factor. Conclusions and Implications: A machine-learning model can help quantify and predict NH infection risk. The identified risk factors support the early identification and management of presymptomatic and asymptomatic individuals (eg, staff) entering the NH from the surrounding community and the development of financially sustainable staff testing initiatives in preventing COVID-19 infection. (C) 2020 AMDA - The Society for Post-Acute and Long-Term Care Medicine.
引用
收藏
页码:1533 / +
页数:12
相关论文
共 50 条
  • [21] Exploring Coronavirus Disease 2019 Risk Factors: A Text Network Analysis Approach
    Kang, Min-Ah
    Lee, Soo-Kyoung
    JOURNAL OF CLINICAL MEDICINE, 2025, 14 (06)
  • [22] Risk factors associated with COVID-19 infection and mortality in nursing homes
    Telleria, Idoia Beobide
    Uriguen, Alexander Ferro
    Lucas, Esther Laso
    Menicucci, Cinzia Sannino
    Barroso, Maria Enriquez
    Arregui, Adolfo Lopez de Munain
    ATENCION PRIMARIA, 2022, 54 (10):
  • [23] Machine Learning: Predicting Credit Risk
    Melo, Rafael Almeida Pereira
    Guimaraes, Paulo Henrique Sales
    Melo, Marcel Irving Pereira
    SIGMAE, 2024, 13 (04): : 219 - 230
  • [24] Construction and validation of a machine learning-based nomogram: A tool to predict the risk of getting severe coronavirus disease 2019 (COVID-19)
    Yao, Zhixian
    Zheng, Xinyi
    Zheng, Zhong
    Wu, Ke
    Zheng, Junhua
    IMMUNITY INFLAMMATION AND DISEASE, 2021, 9 (02) : 595 - 607
  • [25] Machine learning in the diagnosis of asthma phenotypes during coronavirus disease 2019 pandemic
    Gawlewicz-Mroczka, Agnieszka
    Pytlewski, Adam
    Celejewska-Wojcik, Natalia
    Cmiel, Adam
    Gielicz, Anna
    Sanak, Marek
    Mastalerz, Lucyna
    CLINICAL AND TRANSLATIONAL ALLERGY, 2022, 12 (10)
  • [26] Interpretable machine learning for predicting chronic kidney disease progression risk
    Zheng, Jin-Xin
    Li, Xin
    Zhu, Jiang
    Guan, Shi-Yang
    Zhang, Shun-Xian
    Wang, Wei-Ming
    DIGITAL HEALTH, 2024, 10
  • [27] Predicting postoperative pulmonary infection risk in patients with diabetes using machine learning
    Zhao, Chunxiu
    Xiang, Bingbing
    Zhang, Jie
    Yang, Pingliang
    Liu, Qiaoli
    Wang, Shun
    FRONTIERS IN PHYSIOLOGY, 2024, 15
  • [28] Candidemia and Severe Coronavirus Disease 2019: Which Risk Factors Are Modifiable?
    McCarty, Todd
    CLINICAL INFECTIOUS DISEASES, 2022, 74 (05) : 812 - 813
  • [29] Stroke Risk Related to Coronavirus Disease-2019 What Have We Learned?
    Pourciau, Pamela
    Smith, Britta C.
    CRITICAL CARE NURSING CLINICS OF NORTH AMERICA, 2023, 35 (01) : 53 - 65
  • [30] A Machine Learning-Based Model for Predicting the Risk of Cardiovascular Disease
    Hsiao, Chiu-Han
    Yu, Po-Chun
    Hsieh, Chia-Ying
    Zhong, Bing-Zi
    Tsai, Yu-Ling
    Cheng, Hao-min
    Chang, Wei-Lun
    Lin, Frank Yeong-Sung
    Huang, Yennun
    ADVANCED INFORMATION NETWORKING AND APPLICATIONS, AINA-2022, VOL 1, 2022, 449 : 364 - 374