Systematic identification of pharmacogenomics information from clinical trials

被引:20
作者
Li, Jiao [1 ]
Lu, Zhiyong [1 ]
机构
[1] Natl Lib Med, NIH, Bethesda, MD 20894 USA
基金
美国国家卫生研究院;
关键词
Text mining; Clinical outcome; Pharmacogenomics; Clinical trial; DRUG DISCOVERY; TEXT; REGISTRATION; DESIGN; GENE;
D O I
10.1016/j.jbi.2012.04.005
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Recent progress in high-throughput genomic technologies has shifted pharmacogenomic research from candidate gene pharmacogenetics to clinical pharmacogenomics (PGx). Many clinical related questions may be asked such as 'what drug should be prescribed for a patient with mutant alleles?' Typically, answers to such questions can be found in publications mentioning the relationships of the gene-drug-disease of interest. In this work, we hypothesize that ClinicalTrials.gov is a comparable source rich in PGx related information. In this regard, we developed a systematic approach to automatically identify PGx relationships between genes, drugs and diseases from trial records in ClinicalTrials.gov. In our evaluation, we found that our extracted relationships overlap significantly with the curated factual knowledge through the literature in a PGx database and that most relationships appear on average 5 years earlier in clinical trials than in their corresponding publications, suggesting that clinical trials may be valuable for both validating known and capturing new PGx related information in a more timely manner. Furthermore, two human reviewers judged a portion of computer-generated relationships and found an overall accuracy of 74% for our text-mining approach. This work has practical implications in enriching our existing knowledge on PGx gene-drug-disease relationships as well as suggesting crosslinks between ClinicalTrials.gov and other PGx knowledge bases. Published by Elsevier Inc.
引用
收藏
页码:870 / 878
页数:9
相关论文
共 38 条
[1]  
Ahlers CB, 2007, PACIFIC SYMPOSIUM ON BIOCOMPUTING 2007, P209
[2]   Indexing pharmacogenetic knowledge on the World Wide Web [J].
Altman, RB ;
Flockhart, DA ;
Sherry, ST ;
Oliver, DE ;
Rubin, DL ;
Klein, TE .
PHARMACOGENETICS, 2003, 13 (01) :3-5
[3]  
[Anonymous], 2006, PROC 5 INT C LANGUAG
[4]  
[Anonymous], 2003, INT MIGRATION HLTH H, P8
[5]   MutationFinder: a high-performance system for extracting point mutation mentions from text [J].
Caporaso, J. Gregory ;
Baumgartner, William A., Jr. ;
Randolph, David A. ;
Cohen, K. Bretonnel ;
Hunter, Lawrence .
BIOINFORMATICS, 2007, 23 (14) :1862-1865
[6]   Empirical evidence for selective reporting of outcomes in randomized trials -: Comparison of Protocols to published articles [J].
Chan, AW ;
Hróbjartsson, A ;
Haahr, MT ;
Gotzsche, PC ;
Altman, DG .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2004, 291 (20) :2457-2465
[7]   GAPSCORE:: finding gene and protein names one word at a time [J].
Chang, JT ;
Schütze, H ;
Altman, RB .
BIOINFORMATICS, 2004, 20 (02) :216-225
[8]  
Cohen KB, 2011, BIOCOMPUT-PAC SYM, P362
[9]   Using text to build semantic networks for pharmacogenomics [J].
Coulet, Adrien ;
Shah, Nigam H. ;
Garten, Yael ;
Musen, Mark ;
Altman, Russ B. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2010, 43 (06) :1009-1019
[10]  
Coulet A, 2010, BIOCOMPUT-PAC SYM, P485