AN ANALYSIS OF MULTIPLE SIMILARITY MEASURES FOR ONTOLOGY MAPPING PROBLEM

被引:8
作者
Ichise, Ryutaro [1 ]
机构
[1] Natl Inst Informat, Principles Informat Res Div, Chiyodo Ku, 2-1-2 Hitotsubashi, Tokyo 1018430, Japan
关键词
Semantic integration; ontology mapping; semantic web; machine learning; data mining;
D O I
10.1142/S1793351X1000095X
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an analysis of similarity measures for the ontology mapping problem. To that end, 48 similarity measures such as string matching and knowledge based similarities that have been widely used in ontology mapping systems are defined. The similarity measures are investigated by discriminant analysis with a real-world data set. As a result, it was possible to identify 22 effective similarity measures for the ontology mapping problem out of 48 possible similarity measures. The identified measures have a wide variety in the type of similarity. To test whether the identified similarity measures are effective for the problem, experiments were conducted with all 48 similarity measures and the 22 identified similarity measures by using two major machine learning methods, decision tree and support vector machine. The experimental results show that the performance of the 48 cases and the 22 cases is almost the same regardless of the machine learning method. This implies that effective features for the ontology mapping problem were successfully identified.
引用
收藏
页码:103 / 122
页数:20
相关论文
共 32 条
  • [1] Aoki S., 2009, BLACK BOX DATA ANAL
  • [2] The Semantic Web - A new form of Web content that is meaningful to computers will unleash a revolution of new possibilities
    Berners-Lee, T
    Hendler, J
    Lassila, O
    [J]. SCIENTIFIC AMERICAN, 2001, 284 (05) : 34 - +
  • [3] Caraciolo C, 2008, PROC 3 ISWC WORKSHOP, P73
  • [4] Choi N, 2006, SIGMOD REC, V35, P34, DOI 10.1145/1168092.1168097
  • [5] Cristianini N, 2000, INTRO SUPPORT VECTOR
  • [6] Doan A, 2003, VLDB J, V12, P303, DOI [10.1007/s00778-003-0104-2, 10.1007/S00778-003-0104-2]
  • [7] Duda R. O., 2001, PATTERN CLASSIFICATI, V2nd
  • [8] Eckert K, 2009, LECT NOTES COMPUT SC, V5554, P158, DOI 10.1007/978-3-642-02121-3_15
  • [9] Ehrig M, 2005, LECT NOTES COMPUT SC, V3729, P186, DOI 10.1007/11574620_16
  • [10] Euzenat J., 2004, P 3 INT WORKSH EV ON, P56