Detecting Identical Entities in the Semantic Web Data

被引:0
|
作者
Holub, Michal [1 ]
Proksa, Ondrej [1 ]
Bielikova, Maria [1 ]
机构
[1] Slovak Univ Technol Bratislava, Inst Informat & Software Engn, Fac Informat & Informat Technol, Bratislava 84216, Slovakia
来源
SOFSEM 2015: THEORY AND PRACTICE OF COMPUTER SCIENCE | 2015年 / 8939卷
关键词
duplicates; identity; similarity; relationship; semantic web; owl:sameAs; Linked Data; web of data;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large amount of entities published by various sources inevitably introduces inaccuracies, mainly duplicated information. These can even be found within a single dataset. In this paper we propose a method for automatic discovery of identity relationship between two entities (also known as instance matching) in a dataset represented as a graph (e.g. in the Linked Data Cloud). Our method can be used for cleaning existing datasets from duplicates, validating of existing identity relationships between entities within a dataset, or for connecting different datasets using the owl:sameAs relationship. Our method is based on the analysis of sub-graphs formed by entities, their properties and existing relationships between them. It can learn a common similarity threshold for particular dataset, so it is adaptable to its different properties. We evaluated our method by conducting several experiments on data from the domains of public administration and digital libraries.
引用
收藏
页码:519 / 530
页数:12
相关论文
共 50 条
  • [31] Proposal for Extending New Linked Data Rules for the Semantic Web
    Martinez Tomas, Rafael
    Criado Fernandez, Luis
    FOUNDATIONS ON NATURAL AND ARTIFICIAL COMPUTATION: 4TH INTERNATIONAL WORK-CONFERENCE ON THE INTERPLAY BETWEEN NATURAL AND ARTIFICIAL COMPUTATION, IWINAC 2011, PART I, 2011, 6686 : 531 - 539
  • [33] Linked Data Query Wizard: A Tabular Interface for the Semantic Web
    Hoefler, Patrick
    Granitzer, Michael
    Sabol, Vedran
    Lindstaedt, Stefanie
    SEMANTIC WEB: ESWC 2013 SATELLITE EVENTS, 2013, 7955 : 173 - 177
  • [34] Audiovisual resources in the Data Web: The construction of the Semantic Audiovisual Portal
    Coneglian, Caio Saraiva
    Simionato Arakaki, Ana Carolina
    Ventura Amorim Goncalez, Paula Regina
    Santarem Segundo, Jose Eduardo
    TRANSINFORMACAO, 2019, 31
  • [35] Detecting Similar Areas of Knowledge Using Semantic and Data Mining Technologies
    Sumba, Xavier
    Sumba, Freddy
    Tello, Andres
    Baculima, Fernando
    Espinoza, Mauricio
    Saquicela, Victor
    ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2016, 329 (329) : 149 - 167
  • [36] Simplifying Semantic Web application development and semantic data usage
    Rico, Mariano
    AI COMMUNICATIONS, 2010, 23 (01) : 65 - 66
  • [37] A unified approach to retrieving web documents and Semantic Web data
    Immaneni, Trivilcram
    Thirunarayan, Krishnaprasad
    SEMANTIC WEB: RESEARCH AND APPLICATIONS, PROCEEDINGS, 2007, 4519 : 579 - +
  • [38] Linking In-Game Events and Entities to Social Data on the Web
    Sacco, Owen
    Dabrowski, Maciej
    Breslin, John G.
    2012 IEEE INTERNATIONAL GAMES INNOVATION CONFERENCE (IGIC), 2012, : 78 - +
  • [39] Semantic Enrichment of Web Data for the Provision of an Unified Data Repository of Brazilian Missing Persons
    Gomes, Jorao, Jr.
    Ferranti, Nicolas
    de Souza, Jairo Francisco
    PROCEEDINGS OF THE XV BRAZILIAN SYMPOSIUM ON INFORMATION SYSTEMS, SBSI 2019: Complexity on Modern Information Systems, 2019,
  • [40] Moodle Meets Linked Data: Publishing Moodle on the Web of Data Using Semantic Links
    Mosharraf, Maedeh
    Taghiyareh, Fattaneh
    2018 4TH INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2018, : 6 - 11