Detecting Identical Entities in the Semantic Web Data

被引:0
|
作者
Holub, Michal [1 ]
Proksa, Ondrej [1 ]
Bielikova, Maria [1 ]
机构
[1] Slovak Univ Technol Bratislava, Inst Informat & Software Engn, Fac Informat & Informat Technol, Bratislava 84216, Slovakia
来源
SOFSEM 2015: THEORY AND PRACTICE OF COMPUTER SCIENCE | 2015年 / 8939卷
关键词
duplicates; identity; similarity; relationship; semantic web; owl:sameAs; Linked Data; web of data;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large amount of entities published by various sources inevitably introduces inaccuracies, mainly duplicated information. These can even be found within a single dataset. In this paper we propose a method for automatic discovery of identity relationship between two entities (also known as instance matching) in a dataset represented as a graph (e.g. in the Linked Data Cloud). Our method can be used for cleaning existing datasets from duplicates, validating of existing identity relationships between entities within a dataset, or for connecting different datasets using the owl:sameAs relationship. Our method is based on the analysis of sub-graphs formed by entities, their properties and existing relationships between them. It can learn a common similarity threshold for particular dataset, so it is adaptable to its different properties. We evaluated our method by conducting several experiments on data from the domains of public administration and digital libraries.
引用
收藏
页码:519 / 530
页数:12
相关论文
共 50 条
  • [41] From Overview to Facets and Pivoting for Interactive Exploration of Semantic Web Data
    Brunetti, Josep Maria
    Garcia, Roberto
    Auer, Soeren
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2013, 9 (01) : 1 - 20
  • [42] Semantic Web-based Aggregation of Indonesian Open Development Data
    Nugraha, Tubagus Andhika
    Gambetta, Windy
    2014 International Conference on Data and Software Engineering (ICODSE), 2014,
  • [43] Deriving Similarity Graphs from open Linked Data on Semantic Web
    Mi, Jinhua
    Chen, Huajun
    Lu, Bin
    Yu, Tong
    Pan, Gang
    PROCEEDINGS OF THE 2009 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION, 2008, : 157 - 162
  • [44] Searching and browsing Linked Data with SWSE: The Semantic Web Search Engine
    Hogan, Aidan
    Harth, Andreas
    Umbrich, Juergen
    Kinsella, Sheila
    Polleres, Axel
    Decker, Stefan
    JOURNAL OF WEB SEMANTICS, 2011, 9 (04): : 365 - 401
  • [45] Positioning Library Data for the Semantic Web: Recent Developments in Resource Description
    Szeto, Kimmy
    JOURNAL OF WEB LIBRARIANSHIP, 2013, 7 (03) : 305 - 321
  • [46] A Semantic Web Approach in the implementation of a Linked Data Portal using a CMS
    Giannopoulou, Eleni
    Mitrou, Nikolas
    Chimos, Konstantinos
    Karvounidis, Theodoros
    Douligeris, Christos
    10TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY AND INTERNET-BASED SYSTEMS SITIS 2014, 2014, : 164 - 171
  • [47] Linked data: a new alphabet for the semantic web
    Guerrini, Mauro
    Possemato, Tiziana
    JLIS.IT, 2013, 4 (01): : 67 - 90
  • [48] The role of semantic web in the big data process
    Coneglian, Caio Saraiva
    Dieger, Rodrigo
    Santarem Segundo, Jose Eduardo
    Capretz, Miriam
    ENCONTROS BIBLI-REVISTA ELETRONICA DE BIBLIOTECONOMIA E CIENCIA DA INFORMACAO, 2018, 23 (53): : 138 - 147
  • [49] RDA and the Semantic Web, Linked Data Environment
    Tillett, Barbara
    JLIS.IT, 2013, 4 (01): : 139 - 145
  • [50] Semantic Web Techniques Meet Sensor Data
    Gimenez-Garcia, Jose M.
    INTELLIGENT ENVIRONMENTS 2018, 2018, 23 : 7 - 7