Using Unsupervised Machine Learning to Identify Age- and Sex-Independent Severity Subgroups Among Patients with COVID-19: Observational Longitudinal Study

被引:15
作者
Benito-Leon, Julian [1 ]
Dolores del Castillo, Ma [2 ]
Estirado, Alberto [3 ]
Ghosh, Ritwik [4 ]
Dubey, Souvik [5 ]
Serrano, J. Ignacio [2 ]
机构
[1] Univ Hosp 12 Octubre, Dept Neurol, Ave Cordoba S-N, Madrid 28041, Spain
[2] CSIC UPM, Neural & Cognit Engn Grp, Ctr Automat & Robot, Arganda Del Rey, Spain
[3] HM Hosp, Madrid, Spain
[4] Burdwan Med Coll & Hosp, Dept Gen Med, Burdwan, W Bengal, India
[5] Bangur Inst Neurosci, Dept Neuromed, Kolkata, India
关键词
COVID-19; machine learning; outcome; severity; subgroup; emergency; detection; intervention; testing; data set; characterization; LACTATE-DEHYDROGENASE; MORTALITY; DIAGNOSIS;
D O I
10.2196/25988
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Early detection and intervention are the key factors for improving outcomes in patients with COVID-19. Objective: The objective of this observational longitudinal study was to identify nonoverlapping severity subgroups (ie, clusters) among patients with COVID-19, based exclusively on clinical data and standard laboratory tests obtained during patient assessment in the emergency department. Methods: We applied unsupervised machine learning to a data set of 853 patients with COVID-19 from the HM group of hospitals (HM Hospitales) in Madrid, Spain. Age and sex were not considered while building the clusters, as these variables could introduce biases in machine learning algorithms and raise ethical implications or enable discrimination in triage protocols. Results: From 850 clinical and laboratory variables, four tests-the serum levels of aspartate transaminase (AST), lactate dehydrogenase (LDH), C-reactive protein (CRP), and the number of neutrophils-were enough to segregate the entire patient pool into three separate clusters. Further, the percentage of monocytes and lymphocytes and the levels of alanine transaminase (ALT) distinguished cluster 3 patients from the other two clusters. The highest proportion of deceased patients; the highest levels of AST, ALT, LDH, and CRP; the highest number of neutrophils; and the lowest percentages of monocytes and lymphocytes characterized cluster 1. Cluster 2 included a lower proportion of deceased patients and intermediate levels of the previous laboratory tests. The lowest proportion of deceased patients; the lowest levels of AST, ALT, LDH, and CRP; the lowest number of neutrophils; and the highest percentages of monocytes and lymphocytes characterized cluster 3. Conclusions: A few standard laboratory tests, deemed available in all emergency departments, have shown good discriminative power for the characterization of severity subgroups among patients with COVID-19.
引用
收藏
页数:14
相关论文
共 50 条
[31]   Using D-dimer as a Biomarker to Predict COVID-19 Disease Severity from Clinical Data of Hospitalized Patients: A Machine Learning Approach [J].
Wu, Yuqi ;
Ren, Yang ;
Wu, Dezhi ;
Xirasagar, Sudha ;
Johnson, Joseph .
2022 IEEE 10TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2022), 2022, :664-668
[32]   Identification of Age-Related Characteristic Genes Involved in Severe COVID-19 Infection Among Elderly Patients Using Machine Learning and Immune Cell Infiltration Analysis [J].
Li, Huan ;
Zhao, Jin ;
Xing, Yan ;
Chen, Jia ;
Wen, Ziying ;
Ma, Rui ;
Han, Fengxia ;
Huang, Boyong ;
Wang, Hao ;
Li, Cui ;
Chen, Yang ;
Ning, Xiaoxuan .
BIOCHEMICAL GENETICS, 2025, 63 (03) :2040-2060
[33]   An Unsupervised Machine Learning Clustering and Prediction of Differential Clinical Phenotypes of COVID-19 Patients Based on Blood Tests-A Hong Kong Population Study [J].
Lau, Kitty Yu-Yeung ;
Ng, Kei-Shing ;
Kwok, Ka-Wai ;
Tsia, Kevin Kin-Man ;
Sin, Chun-Fung ;
Lam, Ching-Wan ;
Vardhanabhuti, Varut .
FRONTIERS IN MEDICINE, 2022, 8
[34]   Cardiovascular and Renal Comorbidities Included into Neural Networks Predict the Outcome in COVID-19 Patients Admitted to an Intensive Care Unit: Three-Center, Cross-Validation, Age- and Sex-Matched Study [J].
Ovcharenko, Evgeny ;
Kutikhin, Anton ;
Gruzdeva, Olga ;
Kuzmina, Anastasia ;
Slesareva, Tamara ;
Brusina, Elena ;
Kudasheva, Svetlana ;
Bondarenko, Tatiana ;
Kuzmenko, Svetlana ;
Osyaev, Nikolay ;
Ivannikova, Natalia ;
Vavin, Grigory ;
Moses, Vadim ;
Danilov, Viacheslav ;
Komossky, Egor ;
Klyshnikov, Kirill .
JOURNAL OF CARDIOVASCULAR DEVELOPMENT AND DISEASE, 2023, 10 (02)
[35]   Assessment of the disease severity in patients hospitalized for COVID-19 based on the National Early Warning Score (NEWS) using statistical and machine learning methods: An electronic health records database analysis [J].
Lycholip, Valentinas ;
Puronaite, Roma ;
Skorniakov, Viktor ;
Navickas, Petras ;
Tarutyte, Gabriele ;
Trinkunas, Justas ;
Burneikaite, Greta ;
Kazenaite, Edita ;
Jankauskiene, Augustina .
TECHNOLOGY AND HEALTH CARE, 2023, 31 (06) :2513-2524
[36]   Unraveling relevant cross-waves pattern drifts in patient-hospital risk factors among hospitalized COVID-19 patients using explainable machine learning methods [J].
Lana, Fernanda Cristina Barbosa ;
Marinho, Carolina Coimbra ;
de Paiva, Bruno Barbosa Miranda ;
Valle, Lucas Rocha ;
do Nascimento, Guilherme Fonseca ;
da Rocha, Leonardo Chaves Dutra ;
Carneiro, Marcelo ;
Batista, Joanna d'Arc Lyra ;
Anschau, Fernando ;
Paraiso, Pedro Gibson ;
Bartolazzi, Frederico ;
Cimini, Christiane Correa Rodrigues ;
Schwarzbold, Alexandre Vargas ;
Rios, Danyelle Romana Alves ;
Goncalves, Marcos Andre ;
Marcolino, Milena Soriano .
BMC INFECTIOUS DISEASES, 2025, 25 (01)
[37]   Exploring the intersection of obesity and gender in COVID-19 outcomes in hospitalized Mexican patients: a comparative analysis of risk profiles using unsupervised machine learning [J].
Nezhadmoghadam, Fahimeh ;
Tamez-Pena, Jose Gerardo ;
Martinez-Ledesma, Emmanuel .
FRONTIERS IN PUBLIC HEALTH, 2024, 12
[38]   Machine learning approaches to predict the need for intensive care unit admission among Iranian COVID-19 patients based on ICD-10: A cross-sectional study [J].
Karimi, Zahra ;
Malak, Jaleh S. ;
Aghakhani, Amirhossein ;
Najafi, Mohammad S. ;
Ariannejad, Hamid ;
Zeraati, Hojjat ;
Yekaninejad, Mir S. .
HEALTH SCIENCE REPORTS, 2024, 7 (09)
[39]   Impact of COVID-19 research: a study on predicting influential scholarly documents using machine learning and a domain-independent knowledge graph [J].
Rabby, Gollam ;
D'Souza, Jennifer ;
Oelen, Allard ;
Dvorackova, Lucie ;
Svatek, Vojtech ;
Auer, Soeren .
JOURNAL OF BIOMEDICAL SEMANTICS, 2023, 14 (01)
[40]   Impact of COVID-19 research: a study on predicting influential scholarly documents using machine learning and a domain-independent knowledge graph [J].
Gollam Rabby ;
Jennifer D’Souza ;
Allard Oelen ;
Lucie Dvorackova ;
Vojtěch Svátek ;
Sören Auer .
Journal of Biomedical Semantics, 14