COVID-19 trial graph: a linked graph for COVID-19 clinical trials

被引:3
作者
Du, Jingcheng [1 ]
Wang, Qing [1 ]
Wang, Jingqi [1 ]
Ramesh, Prerana [1 ]
Xiang, Yang [1 ]
Jiang, Xiaoqian [1 ]
Tao, Cui [1 ]
机构
[1] Univ Texas Hlth Sci Ctr Houston, Sch Biomed Informat, 7000 Fannin,Suite 600, Houston, TX 77030 USA
基金
美国国家卫生研究院;
关键词
clinical trial; COVID-19; eligibility criteria; graph representation;
D O I
10.1093/jamia/ocab078
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: Clinical trials are an essential part of the effort to find safe and effective prevention and treatment for COVID-19. Given the rapid growth of COVID-19 clinical trials, there is an urgent need for a better clinical trial information retrieval tool that supports searching by specifying criteria, including both eligibility criteria and structured trial information. Materials and Methods: We built a linked graph for registered COVID-19 clinical trials: the COVID-19 Trial Graph, to facilitate retrieval of clinical trials. Natural language processing tools were leveraged to extract and normalize the clinical trial information from both their eligibility criteria free texts and structured information from ClinicalTrials.gov . We linked the extracted data using the COVID-19 Trial Graph and imported it to a graph database, which supports both querying and visualization. We evaluated trial graph using case queries and graph embedding. Results: The graph currently (as of October 5, 2020) contains 3392 registered COVID-19 clinical trials, with 17 480 nodes and 65 236 relationships. Manual evaluation of case queries found high precision and recall scores on retrieving relevant clinical trials searching from both eligibility criteria and trial-structured information. We observed clustering in clinical trials via graph embedding, which also showed superiority over the baseline (0.870 vs 0.820) in evaluating whether a trial can complete its recruitment successfully. Conclusions: The COVID-19 Trial Graph is a novel representation of clinical trials that allows diverse search queries and provides a graph-based visualization of COVID-19 clinical trials. High-dimensional vectors mapped by graph embedding for clinical trials would be potentially beneficial for many downstream applications, such as trial end recruitment status prediction and trial similarity comparison. Our methodology also is generalizable to other clinical trials.
引用
收藏
页码:1964 / 1969
页数:6
相关论文
共 12 条
[1]  
[Anonymous], 2020, COVID 19 MAP
[2]   A Comprehensive Survey of Graph Embedding: Problems, Techniques, and Applications [J].
Cai, HongYun ;
Zheng, Vincent W. ;
Chang, Kevin Chen-Chuan .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (09) :1616-1637
[3]   Gene2vec: distributed representation of genes based on co-expression [J].
Du, Jingcheng ;
Jia, Peilin ;
Dai, Yulin ;
Tao, Cui ;
Zhao, Zhongming ;
Zhi, Degui .
BMC GENOMICS, 2019, 20 (Suppl 1)
[4]   node2vec: Scalable Feature Learning for Networks [J].
Grover, Aditya ;
Leskovec, Jure .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :855-864
[5]   Ten quick tips for effective dimensionality reduction [J].
Lan Huong Nguyen ;
Holmes, Susan .
PLOS COMPUTATIONAL BIOLOGY, 2019, 15 (06)
[6]  
Mikolov T., 2013, ICLR, V1301, P3781
[7]  
Neo4j, 2020, Neo4j Graph Platform The Leader in Graph Databases
[8]   Constructing co-occurrence network embeddings to assist association extraction for COVID-19 and other coronavirus infectious diseases [J].
Oniani, David ;
Jiang, Guoqian ;
Liu, Hongfang ;
Shen, Feichen .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (08) :1259-1267
[9]   CLAMP - a toolkit for efficiently building customized clinical natural language processing pipelines [J].
Soysal, Ergin ;
Wang, Jingqi ;
Jiang, Min ;
Wu, Yonghui ;
Pakhomov, Serguei ;
Liu, Hongfang ;
Xu, Hua .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2018, 25 (03) :331-336
[10]   A real-time dashboard of clinical trials for COVID-19 [J].
Thorlund, Kristian ;
Dron, Louis ;
Park, Jay ;
Hsu, Grace ;
Forrest, Jamie I. ;
Mills, Edward J. .
LANCET DIGITAL HEALTH, 2020, 2 (06) :E286-E287