GORouter: an RDF model for providing semantic query and inference services for Gene Ontology and its associations

被引:8
作者
Xu, Qingwei [1 ]
Shi, Yixiang [2 ,3 ]
Lu, Qiang [1 ]
Zhang, Guoqing [2 ]
Luo, Qingming [1 ]
Li, Yixue [2 ]
机构
[1] Huazhong Univ Sci & Technol, Key Lab Biomed Photon Minist Educ, Wuhan 430074, Peoples R China
[2] Shanghai Ctr Bioinformat Technol, Shanghai 200235, Peoples R China
[3] Chinese Acad Sci, Shanghai Inst Biol Sci, Key Lab Syst Biol, Bioinformat Ctr, Shanghai 200031, Peoples R China
关键词
D O I
10.1186/1471-2105-9-S1-S6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The most renowned biological ontology, Gene Ontology (GO) is widely used for annotations of genes and gene products of different organisms. However, there are shortcomings in the Resource Description Framework (RDF) data file provided by the GO consortium: 1) Lack of sufficient semantic relationships between pairs of terms coming from the three independent GO sub-ontologies, that limit the power to provide complex semantic queries and inference services based on it. 2) The term-centric view of GO annotation data and the fact that all information is stored in a single file. This makes attempts to retrieve GO annotations based on big volume datasets unmanageable. 3) No support of GOSlim. Results: We propose a RDF model, GORouter, which encodes heterogeneous original data in a uniform RDF format, creates additional ontology mappings between GO terms, and introduces a set of inference rulebases. Furthermore, we use the Oracle Network Data Model (NDM) as the native RDF data repository and the table function RDF_ MATCH to seamlessly combine the result of RDF queries with traditional relational data. As a result, the scale of GORouter is minimized; information not directly involved in semantic inference is put into relational tables. Conclusion: Our work demonstrates how to use multiple semantic web tools and techniques to provide a mixture of semantic query and inference solutions of GO and its associations. GORouter is licensed under Apache License Version 2.0, and is accessible via the website: http://www.scbit.org/gorouter/.
引用
收藏
页数:11
相关论文
共 40 条
[1]   The genome sequence of Drosophila melanogaster [J].
Adams, MD ;
Celniker, SE ;
Holt, RA ;
Evans, CA ;
Gocayne, JD ;
Amanatides, PG ;
Scherer, SE ;
Li, PW ;
Hoskins, RA ;
Galle, RF ;
George, RA ;
Lewis, SE ;
Richards, S ;
Ashburner, M ;
Henderson, SN ;
Sutton, GG ;
Wortman, JR ;
Yandell, MD ;
Zhang, Q ;
Chen, LX ;
Brandon, RC ;
Rogers, YHC ;
Blazej, RG ;
Champe, M ;
Pfeiffer, BD ;
Wan, KH ;
Doyle, C ;
Baxter, EG ;
Helt, G ;
Nelson, CR ;
Miklos, GLG ;
Abril, JF ;
Agbayani, A ;
An, HJ ;
Andrews-Pfannkoch, C ;
Baldwin, D ;
Ballew, RM ;
Basu, A ;
Baxendale, J ;
Bayraktaroglu, L ;
Beasley, EM ;
Beeson, KY ;
Benos, PV ;
Berman, BP ;
Bhandari, D ;
Bolshakov, S ;
Borkova, D ;
Botchan, MR ;
Bouck, J ;
Brokstein, P .
SCIENCE, 2000, 287 (5461) :2185-2195
[2]   COBrA: a bio-ontology editor [J].
Aitken, S ;
Korf, R ;
Webber, B ;
Bard, J .
BIOINFORMATICS, 2005, 21 (06) :825-826
[3]  
ALEXANDER SRN, 2005, RDF OBJECT TYPE REIF
[4]  
[Anonymous], 2002, WIDE WORLD SMALL HOM, DOI DOI 10.1145/511531.511532
[5]   Understanding and using the meaning of statements in a bio-ontology: recasting the Gene Ontology in OWL [J].
Aranguren, Mikel Egana ;
Bechhofer, Sean ;
Lord, Phillip ;
Sattler, Ulrike ;
Stevens, Robert .
BMC BIOINFORMATICS, 2007, 8 (1)
[6]   Enrichment of OBO ontologies [J].
Bada, Michael ;
Hunter, Lawrence .
JOURNAL OF BIOMEDICAL INFORMATICS, 2007, 40 (03) :300-315
[7]  
Bada N, 2004, SIGMOD REC, V33, P27, DOI 10.1145/1024694.1024699
[8]   Ontologies in biology: Design, applications and future challenges [J].
Bard, JBL ;
Rhee, SY .
NATURE REVIEWS GENETICS, 2004, 5 (03) :213-222
[9]  
BELLEAU F, 2007, BIO2RDF MASHUP BUILD
[10]  
BERNERSLEE JHT, 2001, SCI AM MAGAZINE