RuleHub: A Public Corpus of Rules for Knowledge Graphs

被引:8
作者
Ahmadi, Naser [1 ]
Thi-Thuy-Duyen Truong [1 ]
Le-Hong-Mai Dao [1 ]
Ortona, Stefano [2 ]
Papotti, Paolo [1 ]
机构
[1] EURECOM, Biot, France
[2] Meltwater, London, England
来源
ACM JOURNAL OF DATA AND INFORMATION QUALITY | 2020年 / 12卷 / 04期
关键词
Rule mining; knowledge graphs; graph dependencies; DISCOVERY; WEB;
D O I
10.1145/3409384
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Entity-centric knowledge graphs (KGs) are now popular to collect facts about entities. KGs have rich schemas with a large number of different types and predicates to describe the entities and their relationships. On these rich schemas, logical rules are used to represent dependencies between the data elements. While rules are useful in query answering, data curation, and other tasks, they usually do not come with the KGs. Such rules have to be manually defined or discovered with the help of rule mining methods. We believe this rule-collection task should be done collectively to better capitalize our understanding of the data and to avoid redundant work conducted on the same KGs. For this reason, we introduce RuleHub, our extensible corpus of rules for public KGs. RuleHub provides functionalities for the archival and the retrieval of rules to all users, with an extensible architecture that does not constrain the KG or the type of rules supported. We are populating the corpus with thousands of rules from the most popular KGs and report on our experiments on automatically characterizing the quality of a rule with statistical measures.
引用
收藏
页数:22
相关论文
共 42 条
  • [11] Bollacker K., 2008, P 2008 ACM SIGMOD IN, P1247, DOI DOI 10.1145/1376616.1376746
  • [12] Bordes A., 2013, P 27 ANN C NEURAL IN, V26, P1
  • [13] Carlson A, 2010, AAAI CONF ARTIF INTE, P1306
  • [14] Ontological Pathfinding: Mining First-Order Knowledge from Large Knowledge Bases
    Chen, Yang
    Goldberg, Sean
    Wang, Daisy Zhe
    Johri, Soumitra Siddharth
    [J]. SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 835 - 846
  • [15] Discovering Denial Constraints
    Chu, Xu
    Ilyas, Ihab F.
    Papotti, Paolo
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (13): : 1498 - 1509
  • [16] Discovery of frequent DATALOG patterns
    Dehaspe, L
    Toivonen, H
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 1999, 3 (01) : 7 - 36
  • [17] Knowledge Vault: A Web-Scale Approach to Probabilistic Knowledge Fusion
    Dong, Xin Luna
    Gabrilovich, Evgeniy
    Heitz, Geremy
    Horn, Wilko
    Lao, Ni
    Murphy, Kevin
    Strohmann, Thomas
    Sun, Shaohua
    Zhang, Wei
    [J]. PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 601 - 610
  • [18] Fan W., 2012, Foundations of data quality management, DOI DOI 10.2200/S00439ED1V01Y201207DTM030
  • [19] Discovering Graph Functional Dependencies
    Fan, Wenfei
    Hu, Chunming
    Liu, Xueli
    Lu, Ping
    [J]. SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, : 427 - 439
  • [20] Building Watson: An Overview of the DeepQA Project
    Ferrucci, David
    Brown, Eric
    Chu-Carroll, Jennifer
    Fan, James
    Gondek, David
    Kalyanpur, Aditya A.
    Lally, Adam
    Murdock, J. William
    Nyberg, Eric
    Prager, John
    Schlaefer, Nico
    Welty, Chris
    [J]. AI MAGAZINE, 2010, 31 (03) : 59 - 79