Semantic inference using chemogenomics data for drug discovery

被引:10
作者
Zhu, Qian [1 ]
Sun, Yuyin [2 ]
Challa, Sashikiran [1 ]
Ding, Ying [2 ]
Lajiness, Michael S. [3 ]
Wild, David J. [1 ]
机构
[1] Indiana Univ, Sch Informat & Comp, Bloomington, IN 47408 USA
[2] Indiana Univ, Sch Lib & Informat Sci, Bloomington, IN 47408 USA
[3] Eli Lilly & Co, Indianapolis, IN 46225 USA
关键词
Clozapine; Resource Description Framework; Ergot Alkaloid; Methysergide; SPARQL Query;
D O I
10.1186/1471-2105-12-256
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Semantic Web Technology (SWT) makes it possible to integrate and search the large volume of life science datasets in the public domain, as demonstrated by well-known linked data projects such as LODD, Bio2RDF, and Chem2Bio2RDF. Integration of these sets creates large networks of information. We have previously described a tool called WENDI for aggregating information pertaining to new chemical compounds, effectively creating evidence paths relating the compounds to genes, diseases and so on. In this paper we examine the utility of automatically inferring new compound-disease associations (and thus new links in the network) based on semantically marked-up versions of these evidence paths, rule-sets and inference engines. Results: Through the implementation of a semantic inference algorithm, rule set, Semantic Web methods (RDF, OWL and SPARQL) and new interfaces, we have created a new tool called Chemogenomic Explorer that uses networks of ontologically annotated RDF statements along with deductive reasoning tools to infer new associations between the query structure and genes and diseases from WENDI results. The tool then permits interactive clustering and filtering of these evidence paths. Conclusions: We present a new aggregate approach to inferring links between chemical compounds and diseases using semantic inference. This approach allows multiple evidence paths between compounds and diseases to be identified using a rule-set and semantically annotated data, and for these evidence paths to be clustered to show overall evidence linking the compound to a disease. We believe this is a powerful approach, because it allows compound-disease relationships to be ranked by the amount of evidence supporting them.
引用
收藏
页数:12
相关论文
共 30 条
[11]   Chem2Bio2RDF: a semantic framework for linking and data mining chemogenomic and systems chemical biology data [J].
Chen, Bin ;
Dong, Xiao ;
Jiao, Dazhi ;
Wang, Huijun ;
Zhu, Qian ;
Ding, Ying ;
Wild, David J. .
BMC BIOINFORMATICS, 2010, 11
[12]  
HADACZ L, 1999, LECT NOTES COMPUTER, P353
[13]   Myocarditis and cardiomyopathy associated with clozapine [J].
Kilian, JG ;
Kerr, K ;
Lawrence, C ;
Celermajer, DS .
LANCET, 1999, 354 (9193) :1841-1845
[14]  
KOBAYASHI N, 2010, NATURE PRECEDINGS
[15]   Masked clozapine-induced cardiomyopathy [J].
Pastor, Charles A. ;
Mehta, Monica .
JOURNAL OF THE AMERICAN BOARD OF FAMILY MEDICINE, 2008, 21 (01) :70-74
[16]   Life sciences on the Semantic Web: the Neurocommons and beyond [J].
Ruttenberg, Alan ;
Rees, Jonathan A. ;
Samwald, Matthias ;
Marshall, M. Scott .
BRIEFINGS IN BIOINFORMATICS, 2009, 10 (02) :193-204
[17]  
SIMMONS JQ, 1972, BEHAV NEUROPSYCHIAT, V3, P10
[18]  
Volavka Jan, 2003, Evid Based Ment Health, V6, P93
[19]   GORouter: an RDF model for providing semantic query and inference services for Gene Ontology and its associations [J].
Xu, Qingwei ;
Shi, Yixiang ;
Lu, Qiang ;
Zhang, Guoqing ;
Luo, Qingming ;
Li, Yixue .
BMC BIOINFORMATICS, 2008, 9 (Suppl 1)
[20]   WENDI: A tool for finding non-obvious relationships between compounds and biological properties, genes, diseases and scholarly publications [J].
Zhu, Qian ;
Lajiness, Michael S. ;
Ding, Ying ;
Wild, David J. .
JOURNAL OF CHEMINFORMATICS, 2010, 2