Constructing co-occurrence network embeddings to assist association extraction for COVID-19 and other coronavirus infectious diseases

被引:16
|
作者
Oniani, David [1 ]
Jiang, Guoqian [2 ]
Liu, Hongfang [2 ]
Shen, Feichen [2 ]
机构
[1] Mayo Clin, Kern Ctr Sci Hlth Care Delivery, Rochester, MN 55901 USA
[2] Mayo Clin, Div Digital Hlth Sci, Rochester, MN 55901 USA
关键词
COVID-19; coronavirus INFECTIOUS diseases; co-occurrence network embeddings; association extraction; SARS-COV;
D O I
10.1093/jamia/ocaa117
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: As coronavirus disease 2019 (COVID-19) started its rapid emergence and gradually transformed into an unprecedented pandemic, the need for having a knowledge repository for the disease became crucial. To address this issue, a new COVID-19 machine-readable dataset known as the COVID-19 Open Research Dataset (CORD-19) has been released. Based on this, our objective was to build a computable co-occurrence network embeddings to assist association detection among COVID-19-related biomedical entities. Materials and Methods: Leveraging a Linked Data version of CORD-19 (ie, CORD-19-on-FHIR), we first utilized SPARQL to extract co-occurrences among chemicals, diseases, genes, and mutations and build a co-occurrence network. We then trained the representation of the derived co-occurrence network using node2vec with 4 edge embeddings operations (L1, L2, Average, and Hadamard). Six algorithms (decision tree, logistic regression, support vector machine, random forest, naive Bayes, and multilayer perceptron) were applied to evaluate performance on link prediction. An unsupervised learning strategy was also developed incorporating the t-SNE (t-distributed stochastic neighbor embedding) and DBSCAN (density-based spatial clustering of applications with noise) algorithms for case studies. Results: The random forest classifier showed the best performance on link prediction across different network embeddings. For edge embeddings generated using the Average operation, random forest achieved the optimal average precision of 0.97 along with a F1 score of 0.90. For unsupervised learning, 63 clusters were formed with silhouette score of 0.128. Significant associations were detected for 5 coronavirus infectious diseases in their corresponding subgroups. Conclusions: In this study, we constructed COVID-19-centered co-occurrence network embeddings. Results indicated that the generated embeddings were able to extract significant associations for COVID-19 and coronavirus infectious diseases.
引用
收藏
页码:1259 / 1267
页数:9
相关论文
共 50 条
  • [1] Similarities and Dissimilarities of COVID-19 and Other Coronavirus Diseases
    Fung, To Sing
    Liu, Ding Xiang
    ANNUAL REVIEW OF MICROBIOLOGY, VOL 75, 2021, 2021, 75 : 19 - 47
  • [2] Pan-coronavirus fusion inhibitors to combat COVID-19 and other emerging coronavirus infectious diseases
    Lan, Qiaoshuai
    Wang, Lijue
    Jiao, Fanke
    Lu, Lu
    Xia, Shuai
    Jiang, Shibo
    JOURNAL OF MEDICAL VIROLOGY, 2023, 95 (01)
  • [3] COVID-19 and infectious diseases: a frequent association
    Marchiori, Edson
    Hochhegger, Bruno
    Zanetti, Glaucia
    CLINICAL IMAGING, 2023, 97 : 70 - 71
  • [4] COVID-19 compared to other epidemic coronavirus diseases and the flu
    James A Ayukekbong
    Michel L Ntemgwa
    Solange A Ayukekbong
    Eta E Ashu
    Terence A Agbor
    World Journal of Clinical Infectious Diseases, 2020, (01) : 1 - 13
  • [5] A Case of Co-occurrence of COVID-19 and Group A Streptococcal Pharyngitis
    Chan, Kok Hoe
    Veeraballi, Sindhusha
    Ahmed, Eyad
    Yakobi, Ramy
    Slim, Jihad
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2021, 13 (04)
  • [6] Topic Evolution of Chinese COVID-19 Policies Based on Co-Occurrence Clustering Network Analysis
    Wei, Lu
    Liu, Na
    Chen, Junhua
    Sun, Jihong
    SUSTAINABILITY, 2022, 14 (04)
  • [7] Potential problem of the co-occurrence of pandemic COVID-19 and seasonal influenza
    Mezencev, R.
    Klement, C.
    Dluholucky, S.
    EPIDEMIOLOGIE MIKROBIOLOGIE IMUNOLOGIE, 2021, 70 (01): : 68 - 71
  • [8] Case Report: Co-occurrence of Microangiopathy Limited to the Heart in a COVID-19 Patient
    Menter, Thomas
    Cueni, Nadine
    Gebhard, Eva Caroline
    Tzankov, Alexandar
    FRONTIERS IN CARDIOVASCULAR MEDICINE, 2021, 8
  • [9] Exosomes: Applications in Respiratory Infectious Diseases and Prospects for Coronavirus Disease 2019 (COVID-19)
    Feng, Jia
    Waqas, Ahmed
    Zhu, Zhihan
    Chen, Lukui
    JOURNAL OF BIOMEDICAL NANOTECHNOLOGY, 2020, 16 (04) : 399 - 418
  • [10] Impact of COVID-19 Quarantine and School Cancelation on Other Common Infectious Diseases
    McBride, Joseph A.
    Eickhoff, Jens
    Wald, Ellen R.
    PEDIATRIC INFECTIOUS DISEASE JOURNAL, 2020, 39 (12) : E449 - E452