Constructing co-occurrence network embeddings to assist association extraction for COVID-19 and other coronavirus infectious diseases

被引:16
作者
Oniani, David [1 ]
Jiang, Guoqian [2 ]
Liu, Hongfang [2 ]
Shen, Feichen [2 ]
机构
[1] Mayo Clin, Kern Ctr Sci Hlth Care Delivery, Rochester, MN 55901 USA
[2] Mayo Clin, Div Digital Hlth Sci, Rochester, MN 55901 USA
关键词
COVID-19; coronavirus INFECTIOUS diseases; co-occurrence network embeddings; association extraction; SARS-COV;
D O I
10.1093/jamia/ocaa117
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: As coronavirus disease 2019 (COVID-19) started its rapid emergence and gradually transformed into an unprecedented pandemic, the need for having a knowledge repository for the disease became crucial. To address this issue, a new COVID-19 machine-readable dataset known as the COVID-19 Open Research Dataset (CORD-19) has been released. Based on this, our objective was to build a computable co-occurrence network embeddings to assist association detection among COVID-19-related biomedical entities. Materials and Methods: Leveraging a Linked Data version of CORD-19 (ie, CORD-19-on-FHIR), we first utilized SPARQL to extract co-occurrences among chemicals, diseases, genes, and mutations and build a co-occurrence network. We then trained the representation of the derived co-occurrence network using node2vec with 4 edge embeddings operations (L1, L2, Average, and Hadamard). Six algorithms (decision tree, logistic regression, support vector machine, random forest, naive Bayes, and multilayer perceptron) were applied to evaluate performance on link prediction. An unsupervised learning strategy was also developed incorporating the t-SNE (t-distributed stochastic neighbor embedding) and DBSCAN (density-based spatial clustering of applications with noise) algorithms for case studies. Results: The random forest classifier showed the best performance on link prediction across different network embeddings. For edge embeddings generated using the Average operation, random forest achieved the optimal average precision of 0.97 along with a F1 score of 0.90. For unsupervised learning, 63 clusters were formed with silhouette score of 0.128. Significant associations were detected for 5 coronavirus infectious diseases in their corresponding subgroups. Conclusions: In this study, we constructed COVID-19-centered co-occurrence network embeddings. Results indicated that the generated embeddings were able to extract significant associations for COVID-19 and coronavirus infectious diseases.
引用
收藏
页码:1259 / 1267
页数:9
相关论文
共 50 条
[41]   Enhancing the Understanding of Abdominal Trauma During the COVID-19 Pandemic Through Co-Occurrence Analysis and Machine Learning [J].
Radulescu, Dumitru ;
Calafeteanu, Dan Marian ;
Radulescu, Patricia-Mihaela ;
Boldea, Gheorghe-Jean ;
Mercut, Razvan ;
Ciupeanu-Calugaru, Eleonora Daniela ;
Georgescu, Eugen-Florin ;
Boldea, Ana Maria ;
Georgescu, Ion ;
Caluianu, Elena-Irina ;
Marinescu, Georgiana-Andreea ;
Trasca, Emil-Tiberius .
DIAGNOSTICS, 2024, 14 (21)
[42]   A Study on Co-occurrence of various Lung Diseases and COVID-19 by observing Chest X-Ray Similarity using Deep Convolutional Neural Networks [J].
Sridhar, Sashank ;
Mootha, Siddartha ;
Seetharaman, Rahul ;
Rayan, Arockia Xavier Annie .
2020 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS), 2020,
[43]   Co-occurrence of obesogenic behaviors and their implications for mental health during the COVID-19 pandemic: a study with university students [J].
Barbosa, Bruna Carolina Rafael ;
Mendonca, Raquel de Deus ;
Machado, Elaine Leandro ;
Meireles, Adriana Lucia .
BMC PUBLIC HEALTH, 2024, 24 (01)
[44]   Provider Satisfaction With Infectious Diseases Telemedicine Consults for Hospitalized Patients During the Coronavirus Disease 2019 (COVID-19) Pandemic [J].
Canterino, Joseph E. ;
Wang, Kaicheng ;
Golden, Marjorie .
CLINICAL INFECTIOUS DISEASES, 2022, 74 (04) :711-714
[45]   Lessons Learned from Coronavirus Disease 2019 (COVID-19) Therapies: Critical Perspectives From the Infectious Diseases Society of America (IDSA) COVID-19 Treatment Guideline Panel [J].
Bhimraj, Adarsh ;
Morgan, Rebecca L. ;
Shumaker, Amy Hirsch ;
Baden, Lindsey ;
Chi-Chung Cheng, Vincent ;
Edwards, Kathryn M. ;
Gandhi, Rajesh T. ;
Gallagher, Jason C. ;
Muller, William J. ;
O'Horo, John C. ;
Shoham, Shmuel ;
Wollins, Dana Swartzberg ;
Falck-Ytter, Yngve .
CLINICAL INFECTIOUS DISEASES, 2022, 74 (09) :1691-1695
[46]   Exploring diet associations with Covid-19 and other diseases: a Network Analysis-based approach [J].
Toor, Rashmeet ;
Chana, Inderveer .
MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2022, 60 (04) :991-1013
[47]   Data from Emergency Medical Service Activities: A Novel Approach to Monitoring COVID-19 and Other Infectious Diseases [J].
del Re, Daniele ;
Palla, Luigi ;
Meridiani, Paolo ;
Soffi, Livia ;
Loiudice, Michele Tancredi ;
Antinozzi, Martina ;
Cattaruzza, Maria Sofia .
DIAGNOSTICS, 2025, 15 (02)
[48]   Analysis and modelling of global online public interest in multiple other infectious diseases due to the COVID-19 pandemic [J].
Yang, Yang ;
Wan, Xingyu ;
Zhang, Ning ;
Wu, Zhengyang ;
Qiu, Rong ;
Yuan, Jing ;
Xie, Yinyin .
JOURNAL OF EVALUATION IN CLINICAL PRACTICE, 2024,
[49]   Positive effects of health behaviors acquired during the COVID-19 pandemic process on the prevention of other infectious diseases [J].
Simsek, Asiye Cigdem ;
Buzgan, Turan ;
Aksakal, Fatma Nur Baran ;
Birinci, Suayip ;
Sirin, Hulya .
TURKISH JOURNAL OF MEDICAL SCIENCES, 2023, 53 (06) :1756-1766