A semantic similarity measure based on information distance for ontology alignment

被引:35
作者
Jiang, Yong [1 ]
Wang, Xinmin [1 ]
Zheng, Hai-Tao [1 ]
机构
[1] Tsinghua Univ, Grad Sch Shenzhen, Tsinghua Southampton Web Sci Lab Shenzhen, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Ontology alignment; Semantic measure; Link weight; Information distance; Normalized Google distance;
D O I
10.1016/j.ins.2014.03.021
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ontology alignment is the key point to reach interoperability over ontologies. In semantic web environment, ontologies are usually distributed and heterogeneous and thus it is necessary to find the alignment between them before processing across them. Many efforts have been conducted to automate the alignment by discovering the correspondence between entities of ontologies. However, some problems are still obvious, and the most crucial one is that it is almost impossible to extract semantic meaning of a lexical label that denotes the entity by traditional methods. In this paper, ontology alignment is formalized as a problem of information distance metric. In this way, discovery of optimal alignment is cast as finding out the correspondences with minimal information distance. We demonstrate a novel measure named link weight that uses semantic characteristics of two entities and Google page count to calculate an information distance similarity between them. The experimental results show that our method is able to create alignments between different lexical entities that denotes the same ones. These results outperform the typical ontology alignment methods like PROMPT (Noy and Musen, 2000) [38], QOM (Ehrig and Staab, 2004) [12], and APFEL (Ehrig et al., 2005) [13] in terms of semantic precision and recall. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:76 / 87
页数:12
相关论文
共 44 条
  • [1] [Anonymous], 1989, Kolmogorov Complexity and Its Applications
  • [2] Information distance
    Bennett, CH
    Gacs, P
    Li, M
    Vitanyi, FMB
    Zurek, WH
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1998, 44 (04) : 1407 - 1423
  • [3] Discrete particle swarm optimisation for ontology alignment
    Bock, Juergen
    Hettenhausen, Jan
    [J]. INFORMATION SCIENCES, 2012, 192 : 152 - 173
  • [4] Boddy M., 1991, AAAI-91. Proceedings Ninth National Conference on Artificial Intelligence, P738
  • [5] Automatic keyword prediction using Google similarity distance
    Chen, Ping-I
    Lin, Shi-Jen
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (03) : 1928 - 1938
  • [6] Cilibrasi R., 2004, AUTOMATIC MEAN UNPUB
  • [7] The Google similarity distance
    Cilibrasi, Rudi L.
    Vitanyi, Paul M. B.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (03) : 370 - 383
  • [8] Doan A., 2004, HDB ONTOLOGIES INFOS, P385, DOI [DOI 10.1007/978-3-540-24750-0, 10.1007/978-3-540-24750-0]
  • [9] DuyHoa Ngo, 2012, Knowledge Engineering and Knowledge Management. 18th International Conference, EKAW 2012. Proceedings, P421, DOI 10.1007/978-3-642-33876-2_38
  • [10] Ehrig M, 2005, LECT NOTES COMPUT SC, V3729, P186, DOI 10.1007/11574620_16