Domain-Independent Entity Coreference for Linking Ontology Instances

被引:13
|
作者
Song, Dezhao [1 ]
Heflin, Jeff [1 ]
机构
[1] Lehigh Univ, Dept Comp Sci & Engn, 19 Mem Dr West, Bethlehem, PA 18015 USA
来源
ACM JOURNAL OF DATA AND INFORMATION QUALITY | 2013年 / 4卷 / 02期
关键词
Algorithms; Experimentation; Theory; Entity coreference; semantic web; ontology; domain-independence; discriminability;
D O I
10.1145/2435221.2435223
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The objective of entity coreference is to determine if different mentions (e.g., person names, place names, database records, ontology instances, etc.) refer to the same real word object. Entity coreference algorithms can be used to detect duplicate database records and to determine if two Semantic Web instances represent the same underlying real word entity. The key issues in developing an entity coreference algorithm include how to locate context information and how to utilize the context appropriately. In this article, we present a novel entity coreference algorithm for ontology instances. For scalability reasons, we select a neighborhood of each instance from an RDF graph. To determine the similarity between two instances, our algorithm computes the similarity between comparable property values in the neighborhood graphs. The similarity of distinct URIs and blank nodes is computed by comparing their outgoing links. In an attempt to reduce the impact of distant nodes on the final similarity measure, we explore a distance-based discounting approach. To provide the best possible domain-independent matches, we propose an approach to compute the discriminability of triples in order to assign weights to the context information. We evaluated our algorithm using different instance categories from five datasets. Our experiments show that the best results are achieved by including both our discounting and triple discrimination approaches.
引用
收藏
页数:29
相关论文
共 18 条
  • [1] Domain-independent data cleaning via analysis of entity-relationship graph
    Kalashnikov, Dmitri V.
    Mehrotra, Sharad
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2006, 31 (02): : 716 - 767
  • [2] A Domain-Independent Ontology Learning Method Based on Transfer Learning
    Xie, Kai
    Wang, Chao
    Wang, Peng
    ELECTRONICS, 2021, 10 (16)
  • [3] Linking Heterogeneous Data in the Semantic Web Using Scalable and Domain-Independent Candidate Selection
    Song, Dezhao
    Luo, Yi
    Heflin, Jeff
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (01) : 143 - 156
  • [4] DICON: A Domain-Independent Consent Management for Personal Data Protection
    Olca, Emre
    Can, Ozgu
    IEEE ACCESS, 2022, 10 : 95479 - 95497
  • [5] SYNTACTIC CHARACTERIZATION OF A SUBSET OF DOMAIN-INDEPENDENT FORMULAS
    DEMOLOMBE, R
    JOURNAL OF THE ACM, 1992, 39 (01) : 71 - 94
  • [6] Domain-independent planning for services in uncertain and dynamic environments
    Kaldeli, Eirini
    Lazovik, Alexander
    Aiello, Marco
    ARTIFICIAL INTELLIGENCE, 2016, 236 : 30 - 64
  • [7] Carbon: Domain-Independent Automatic Web Form Filling
    Araujo, Samur
    Gao, Qi
    Leonardi, Erwin
    Houben, Geert-Jan
    WEB ENGINEERING, 2010, 6189 : 292 - 306
  • [8] Automatically Generating Data Linkages Using a Domain-Independent Candidate Selection Approach
    Song, Dezhao
    Heflin, Jeff
    SEMANTIC WEB - ISWC 2011, PT I, 2011, 7031 : 649 - 664
  • [9] Context-sensitive domain-independent algorithm composition and selection
    Johnson, Troy A.
    Eigenmann, Rudolf
    ACM SIGPLAN NOTICES, 2006, 41 (06) : 181 - 192
  • [10] Integration of domain-specific and domain-independent ontologies for colonoscopy video database annotation
    Bao, J
    Cao, Y
    Tavanapong, W
    Honavar, V
    IKE '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE ENGNINEERING, 2004, : 82 - 88