Development of a respiratory virus risk model with environmental data based on interpretable machine learning methods

被引:0
|
作者
Shi, Shuting [1 ]
Lin, Haowen [2 ]
Jiang, Leiming [3 ]
Zeng, Zhiqi [3 ,4 ]
Lin, Chuixu [1 ]
Li, Pei [1 ]
Li, Yinghua [1 ]
Yang, Zifeng [5 ]
机构
[1] Guangzhou KingMed Diagnost Grp Co Ltd, Guangzhou, Peoples R China
[2] Guangdong Acad Med Sci, Guangdong Prov Peoples Hosp, Guangdong Cardiovasc Inst, Guangzhou, Peoples R China
[3] Guangzhou Med Univ, KingMed Sch Lab Med, Guangzhou, Peoples R China
[4] Guangzhou Med Univ, Guangzhou key Lab Clin rapid Diag & early warning, Guangzhou, Peoples R China
[5] Guangzhou Med Univ, Guangzhou Inst Resp Hlth,Affiliated Hosp 1, Natl Clin Res Ctr Resp Dis, State Key Lab Resp Dis, Guangzhou, Peoples R China
来源
NPJ CLIMATE AND ATMOSPHERIC SCIENCE | 2025年 / 8卷 / 01期
关键词
NITROGEN-DIOXIDE; INFECTIONS; AIRWAY;
D O I
10.1038/s41612-025-00894-4
中图分类号
P4 [大气科学(气象学)];
学科分类号
0706 ; 070601 ;
摘要
In recent years, numerous studies have explored the relationship between atmospheric conditions and respiratory viral infections. However, these investigations have faced certain limitations, such as the use of modestly sized datasets, a restricted geographical focus, and an emphasis on a limited number of respiratory pathogens. This study aimed to develop a nationwide respiratory virus infection risk prediction model through machine learning approach. We utilized the CRFC algorithm, a random forest-based method for multi-label classification, to predict the presence of various respiratory viruses. The model integrated binary classification outcomes for each virus category and incorporated air quality and meteorological data to enhance its accuracy. The data was collected from 31 regions in China between 2016 and 2021, encompassing pathogen detection, air quality indices, and meteorological measurements. The model's performance was evaluated using ROC curves, AUC scores, and precision-recall curves. Our model demonstrated robust performance across various metrics, with an average overall accuracy of 0.76, macro sensitivity of 0.75, macro precision of 0.77, and an average AUC score of 0.9. The SHAP framework was employed to interpret the model's predictions, revealing significant contributions from parameters such as age, NO2 levels, and meteorological conditions. Our model provides a reliable tool for predicting respiratory virus risks, with a comprehensive integration of environmental and clinical data. The model's performance metrics indicate its potential utility in clinical decision-making and public health planning. Future work will focus on refining the model and expanding its applicability to diverse populations and settings.
引用
收藏
页数:11
相关论文
共 43 条
  • [1] Linezolid induced thrombocytopenia in critically ill patients: Risk factors and development of a machine learning-based prediction model
    Maray, Ivan
    Rodriguez-Ferreras, Adrian
    Alvarez-Asteinza, Cristina
    Alaguero-Calero, Miguel
    Valledor, Pablo
    Fernandez, Javier
    JOURNAL OF INFECTION AND CHEMOTHERAPY, 2022, 28 (09) : 1249 - 1254
  • [2] A Machine Learning Model for Detecting Respiratory Problems using Voice Recognition
    Das, Soubhik
    2019 IEEE 5TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2019,
  • [3] Machine learning for the identification of respiratory viral attachment machinery from sequences data
    Walker, Kenji C.
    Shwarts, Maia
    Demidikin, Stepan
    Chakravarty, Arijit
    Joseph-McCarthy, Diane
    PLOS ONE, 2023, 18 (03):
  • [4] Development and Validation of a Machine Learning Model for Early Detection of Untreated Infection
    Buell, Kevin G.
    Carey, Kyle A.
    Dussault, Nicole
    Parker, William F.
    Dumanian, Jay
    Bhavani, Sivasubramanium V.
    Gilbert, Emily R.
    Winslow, Christopher J.
    Shah, Nirav S.
    Afshar, Majid
    Edelson, Dana P.
    Churpek, Matthew M.
    CRITICAL CARE EXPLORATIONS, 2024, 6 (10)
  • [5] The Development and Validation of a Machine Learning Model to Predict Bacteremia and Fungemia in Hospitalized Patients Using Electronic Health Record Data
    Bhavani, Sivasubramanium V.
    Lonjers, Zachary
    Carey, Kyle A.
    Afshar, Majid
    Gilbert, Emily R.
    Shah, Nirav S.
    Huang, Elbert S.
    Churpek, Matthew M.
    CRITICAL CARE MEDICINE, 2020, 48 (11) : E1020 - E1028
  • [6] Novel statistical time series data augmentation and machine learning based classification of unobtrusive respiration data for respiration Digital Twin model
    Khan, Sagheer
    Alzaabi, Aaesha
    Ratnarajah, Tharmalingam
    Arslan, Tughrul
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 168
  • [7] Test and validation of methods to sample and detect human virus from environmental surfaces using norovirus as a model virus
    Ibfelt, T.
    Frandsen, T.
    Permin, A.
    Andersen, L. P.
    Schultz, A. C.
    JOURNAL OF HOSPITAL INFECTION, 2016, 92 (04) : 378 - 384
  • [8] Machine learning forecasts for seasonal epidemic peaks: Lessons learnt from an atypical respiratory syncytial virus season
    Morbey, Roger A.
    Todkill, Daniel
    Watson, Conall
    Elliot, Alex J.
    PLOS ONE, 2023, 18 (09):
  • [9] Development of a predictive machine learning model for pathogen profiles in patients with secondary immunodeficiency
    Liu, Qianning
    Chen, Yifan
    Xie, Peng
    Luo, Ying
    Wang, Buxuan
    Meng, Yuanxi
    Zhong, Jiaqian
    Mei, Jiaqi
    Zou, Wei
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
  • [10] Constructing a prediction model for difficult intubation of obese patients based on machine learning
    Zhou, Cheng-Mao
    Xue, Qiong
    Ye, Hao-Tian
    Wang, Ying
    Tong, Jianhua
    Ji, Mu-Huo
    Yang, Jian-Jun
    JOURNAL OF CLINICAL ANESTHESIA, 2021, 72