COnto-Diff: generation of complex evolution mappings for life science ontologies

被引:42
作者
Hartung, Michael [1 ]
Gross, Anika
Rahm, Erhard
机构
[1] Univ Leipzig, Dept Comp Sci, D-04009 Leipzig, Germany
关键词
Ontology evolution; Ontology versions; Diff; Life science ontologies; OBO; TOOL;
D O I
10.1016/j.jbi.2012.04.009
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Life science ontologies evolve frequently to meet new requirements or to better reflect the current domain knowledge. The development and adaptation of large and complex ontologies is typically performed collaboratively by several curators. To effectively manage the evolution of ontologies it is essential to identify the difference (Diff) between ontology versions. Such a Diff supports the synchronization of changes in collaborative curation, the adaptation of dependent data such as annotations, and ontology version management. We propose a novel approach COnto-Diff to determine an expressive and invertible diff evolution mapping between given versions of an ontology. Our approach first matches the ontology versions and determines an initial evolution mapping consisting of basic change operations (insert/update/delete). To semantically enrich the evolution mapping we adopt a rule-based approach to transform the basic change operations into a smaller set of more complex change operations, such as merge, split, or changes of entire subgraphs. The proposed algorithm is customizable in different ways to meet the requirements of diverse ontologies and application scenarios. We evaluate the proposed approach for large life science ontologies including the Gene Ontology and the NCI Thesaurus and compare it with PromptDiff. We further show how the Diff results can be used for version management and annotation migration in collaborative curation. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:15 / 32
页数:18
相关论文
共 47 条
[1]  
[Anonymous], 2007, Ontology matching, DOI 10.1007/978-3-540-49612-0
[2]  
[Anonymous], OBO FLAT FILE FORMAT
[3]  
[Anonymous], 2 INT C BIOM ONT ICB
[4]  
[Anonymous], P ISWC
[5]  
[Anonymous], 2005, P 2005 ACM SIGMOD IN
[6]  
[Anonymous], [No title captured]
[7]  
[Anonymous], P ISWC
[8]  
[Anonymous], AMIA ANN S P
[9]  
[Anonymous], P AMIA ANN S
[10]  
[Anonymous], P 2007 ACM SIGMOD IN