Predicting Coronavirus Disease 2019 Infection Risk and Related Risk Drivers in Nursing Homes: A Machine Learning Approach

被引:29
|
作者
Sun, Christopher L. F. [1 ,2 ]
Zuccarelli, Eugenio [1 ,3 ]
Zerhouni, El Ghali A. [1 ,3 ]
Lee, Jason [1 ,4 ]
Muller, James [5 ]
Scott, Karen M. [1 ]
Lujan, Alida M. [1 ]
Levi, Retsef [1 ]
机构
[1] MIT, Sloan Sch Management, 100 Main St,Room E62-562, Cambridge, MA 02142 USA
[2] Massachusetts Gen Hosp, Healthcare Syst Engn, Boston, MA 02114 USA
[3] MIT, Operat Res Ctr, Cambridge, MA 02142 USA
[4] MIT, Sch Engn, Cambridge, MA 02142 USA
[5] Muller Consulting & Data Analyt LLC, Washington, DC USA
基金
加拿大健康研究院;
关键词
Nursing homes; COVID-19; infection prevention; health policy; long-term care facility; risk modeling; machine-learning; COVID-19; DISPARITIES; QUALITY;
D O I
10.1016/j.jamda.2020.08.030
中图分类号
R592 [老年病学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 100203 ;
摘要
Objective: Inform coronavirus disease 2019 (COVID-19) infection prevention measures by identifying and assessing risk and possible vectors of infection in nursing homes (NHs) using a machine-learning approach. Design: This retrospective cohort study used a gradient boosting algorithm to evaluate risk of COVID-19 infection (ie, presence of at least 1 confirmed COVID-19 resident) in NHs. Setting and Participants: The model was trained on outcomes from 1146 NHs in Massachusetts, Georgia, and New Jersey, reporting COVID-19 case data on April 20, 2020. Risk indices generated from the model using data from May 4 were prospectively validated against outcomes reported on May 11 from 1021 NHs in California. Methods: Model features, pertaining to facility and community characteristics, were obtained from a self-constructed dataset based on multiple public and private sources. The model was assessed via out-of-sample area under the receiver operating characteristic curve (AUC), sensitivity, and specificity in the training (via 10-fold cross-validation) and validation datasets. Results: The mean AUC, sensitivity, and specificity of the model over 10-fold cross-validation were 0.729 [95% confidence interval (CI) 0.690.0.767], 0.670 (95% CI 0.477.0.862), and 0.611 (95% CI 0.412-0.809), respectively. Prospective out-of-sample validation yielded similar performance measures (AUC 0.721; sensitivity 0.622; specificity 0.713). The strongest predictors of COVID-19 infection were identified as the NH's county's infection rate and the number of separate units in the NH; other predictors included the county's population density, historical Centers of Medicare and Medicaid Services cited health deficiencies, and the NH's resident density (in persons per 1000 square feet). In addition, the NH's historical percentage of non-Hispanic white residents was identified as a protective factor. Conclusions and Implications: A machine-learning model can help quantify and predict NH infection risk. The identified risk factors support the early identification and management of presymptomatic and asymptomatic individuals (eg, staff) entering the NH from the surrounding community and the development of financially sustainable staff testing initiatives in preventing COVID-19 infection. (C) 2020 AMDA - The Society for Post-Acute and Long-Term Care Medicine.
引用
收藏
页码:1533 / +
页数:12
相关论文
共 50 条
  • [31] Leading indicators and maritime safety: predicting future risk with a machine learning approach
    Lutz Kretschmann
    Journal of Shipping and Trade, 5 (1)
  • [32] The risk factors for long term cardiovascular symptoms in patients after coronavirus disease 2019 infection
    Liu, Tingting
    Kang, Haofei
    ANNALS OF MEDICINE, 2024, 56 (01)
  • [33] PrOsteoporosis: predicting osteoporosis risk using NHANES data and machine learning approach
    Si, Zebing
    Zhang, Di
    Wang, Huajun
    Zheng, Xiaofei
    BMC RESEARCH NOTES, 2025, 18 (01)
  • [34] Mathematical Modeling to Inform Vaccination Strategies and Testing Approaches for Coronavirus Disease 2019 (COVID-19) in Nursing Homes
    Kahn, Rebecca
    Holmdahl, Inga
    Reddy, Sujan
    Jernigan, John
    Mina, Michael J.
    Slayton, Rachel B.
    CLINICAL INFECTIOUS DISEASES, 2022, 74 (04) : 597 - 603
  • [35] Predicting the Risk of Diabetes and Heart Disease with Machine Learning Classifiers: The Mediation Analysis
    Verma, Ajay
    Jain, Manisha
    MEASUREMENT-INTERDISCIPLINARY RESEARCH AND PERSPECTIVES, 2024,
  • [36] Ethnicity as a Risk Factor for the Novel Coronavirus Disease 2019 (COVID-19)
    Hamad, Abubaker M.
    Al-Mohammed, Hi
    Alkhorayee, Mohammed
    Abbas, Mohammed Y.
    Sulieman, Abdelmoneim
    JOURNAL OF RESEARCH IN MEDICAL AND DENTAL SCIENCE, 2021, 9 (12): : 66 - 75
  • [37] Machine learning models for predicting the risk factor of carotid plaque in cardiovascular disease
    Bin, Chengling
    Li, Qin
    Tang, Jing
    Dai, Chaorong
    Jiang, Ting
    Xie, Xiufang
    Qiu, Min
    Chen, Lumiao
    Yang, Shaorong
    FRONTIERS IN CARDIOVASCULAR MEDICINE, 2023, 10
  • [38] A Geography of Risk: Structural Racism and Coronavirus Disease 2019 Mortality in the United States
    Zalla, Lauren C.
    Martin, Chantel L.
    Edwards, Jessie K.
    Gartner, Danielle R.
    Noppert, Grace A.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2021, 190 (08) : 1439 - 1446
  • [39] A machine learning approach to high-risk cardiac surgery risk scoring
    Rogers, Michael P.
    Janjua, Haroon
    Fishberger, Gregory
    Harish, Abhinav
    Sujka, Joseph
    Toloza, Eric M.
    DeSantis, Anthony J.
    Hooker, Robert L.
    Pietrobon, Ricardo
    Lozonschi, Lucian
    Kuo, Paul C.
    JOURNAL OF CARDIAC SURGERY, 2022, 37 (12) : 4612 - 4620
  • [40] Machine Learning Approaches for Predicting Risk of Cardiometabolic Disease among University Students
    Musleh, Dhiaa
    Alkhwaja, Ali
    Alkhwaja, Ibrahim
    Alghamdi, Mohammed
    Abahussain, Hussam
    Albugami, Mohammed
    Alfawaz, Faisal
    El-Ashker, Said
    Al-Hariri, Mohammed
    BIG DATA AND COGNITIVE COMPUTING, 2024, 8 (03)