A semantic similarity measure for linked data: An information content-based approach

被引:63
作者
Meymandpour, Rouzbeh [1 ]
Davis, Joseph G. [1 ]
机构
[1] Univ Sydney, Sch Informat Technol, Sydney, NSW, Australia
关键词
Semantic Web; Linked Data; Linked Open Data; Similarity measures; Semantic similarity; Information content; Ranking; Recommender systems; Collaborative filtering; Content-based filtering; MODEL; FEATURES; WORDNET;
D O I
10.1016/j.knosys.2016.07.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Linked Data allows structured data to be published in a standard manner so that datasets from diverse domains can be interlinked. By leveraging Semantic Web standards and technologies, a growing amount of semantic content has been published on the Web as Linked Open Data (LOD). The LOD cloud has made available a large volume of structured data in a range of domains via liberal licenses. The semantic content of LOD in conjunction with the advanced searching and querying mechanisms provided by SPARQL has opened up unprecedented opportunities not only for enhancing existing applications, but also for developing new and innovative semantic applications. However, SPARQL is inadequate to deal with functionalities such as comparing, prioritizing, and ranking search results which are fundamental to applications such as recommendation provision, matchmaking, social network analysis, visualization, and data clustering. This paper addresses this problem by developing a systematic measurement model of semantic similarity between resources in Linked Data. By drawing extensively on a feature-based definition of Linked Data, it proposes a generalized information content-based approach that improves on previous methods which are typically restricted to specific knowledge representation models and less relevant in the context of Linked Data. It is validated and evaluated for measuring item similarity in recommender systems. The experimental evaluation of the proposed measure shows that our approach can outperform comparable recommender systems that use conventional similarity measures. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:276 / 293
页数:18
相关论文
共 124 条
[1]  
[Anonymous], 2012, 26 AAAI C ART INT TO
[2]  
[Anonymous], 2011, LINKED DATA EVOLVING
[3]  
[Anonymous], P 16 EUR C ART INT E
[4]  
[Anonymous], ACM T INF SYST
[5]  
[Anonymous], 1997, P 10 RES COMP LING I
[6]  
[Anonymous], 2001, WWW, DOI 10.1145/371920.372071
[7]  
[Anonymous], P PIKM 2012 5 PH D W
[8]  
[Anonymous], P 3 AUSTR WEB C AWC
[9]  
[Anonymous], 2009, FOUND TRENDS INF RET, DOI DOI 10.1561/1500000016
[10]  
[Anonymous], LECT NOTES COMPUTER