EABlock: A Declarative Entity Alignment Block for Knowledge Graph Creation Pipelines

被引:4
作者
Jozashoori, Samaneh [1 ]
Sakor, Ahmad [1 ]
Iglesias, Enrique [2 ]
Vidal, Maria-Esther [1 ]
机构
[1] Leibniz Univ Hannover, TIB Leibniz Informat Ctr Sci & Technol, Hannover, Germany
[2] Leibniz Univ Hannover, L3S Res Ctr, Hannover, Germany
来源
37TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING | 2022年
关键词
Knowledge Graph Creation; Semantic Data Integration; Entity Alignment; Mapping Rules; Functional Mappings;
D O I
10.1145/3477314.3507132
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Despite encoding enormous amount of rich and valuable data, existing data sources are mostly created independently, being a significant challenge to their integration. Mapping languages, e.g., RML and R2RML, facilitate declarative specification of the process of applying meta-data and integrating data into a knowledge graph. Mapping rules can also include knowledge extraction functions in addition to expressing correspondences among data sources and a unified schema. Combining mapping rules and functions represents a powerful formalism to specify pipelines for integrating data into a knowledge graph transparently. Surprisingly, these formalisms are not fully adapted, and many knowledge graphs are created by executing ad-hoc programs to pre-process and integrate data. In this paper, we present EABlock, an approach integrating Entity Alignment (EA) as part of RML mapping rules. EABlock includes a block of functions performing entity recognition from textual attributes and link the recognized entities to the corresponding resources in Wikidata, DBpedia, and domain specific thesaurus, e.g., UMLS. EABlock provides agnostic and efficient techniques to evaluate the functions and transfer the mappings to facilitate its application in any RML-compliant engine. We have empirically evaluated EABlock performance, and results indicate that EABlock speeds up knowledge graph creation pipelines that require entity recognition and linking in state-of-the-art RML-compliant engines. EABlock is also publicly available as a tool through a GitHub repository and a DOI.
引用
收藏
页码:1908 / 1916
页数:9
相关论文
共 25 条
  • [11] Geisler Sandra, 2021, ACM J DATA INFORM QU
  • [12] Gutierrez C, 2021, COMMUN ACM, V64, P96, DOI [10.1145/3418294, 10.1145/3447772]
  • [13] Iglesias Enrique, 2020, ACM INT C INFORM KNO
  • [14] Jimenez-Ruiz E., 2020, CEUR WORKSHOP P, V2775, P1
  • [15] Jozashoori Samaneh, 2020, INT SEMANTIC WEB C
  • [16] Junior Ademar Crotti, 2016, INT CONFER INFORM IN
  • [17] Lenzerini M., 2002, PODS '02, P233, DOI DOI 10.1145/543613.543644
  • [18] Michel Franck, 2020, The Semantic Web - ISWC 2020. 19th International Semantic Web Conference. Lecture Notes in Computer Science (LNCS 12507), P294, DOI 10.1007/978-3-030-62466-8_19
  • [19] Falcon 2.0: An Entity and Relation Linking Tool over Wikidata
    Sakor, Ahmad
    Singh, Kuldeep
    Patel, Anery
    Vidal, Maria-Esther
    [J]. CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 3141 - 3148
  • [20] Sakor Ahmad, 2019, C N AM CHAPTER ASS C