RuleHub: A Public Corpus of Rules for Knowledge Graphs

被引:9
作者
Ahmadi, Naser [1 ]
Thi-Thuy-Duyen Truong [1 ]
Le-Hong-Mai Dao [1 ]
Ortona, Stefano [2 ]
Papotti, Paolo [1 ]
机构
[1] EURECOM, Biot, France
[2] Meltwater, London, England
来源
ACM JOURNAL OF DATA AND INFORMATION QUALITY | 2020年 / 12卷 / 04期
关键词
Rule mining; knowledge graphs; graph dependencies; DISCOVERY; WEB;
D O I
10.1145/3409384
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Entity-centric knowledge graphs (KGs) are now popular to collect facts about entities. KGs have rich schemas with a large number of different types and predicates to describe the entities and their relationships. On these rich schemas, logical rules are used to represent dependencies between the data elements. While rules are useful in query answering, data curation, and other tasks, they usually do not come with the KGs. Such rules have to be manually defined or discovered with the help of rule mining methods. We believe this rule-collection task should be done collectively to better capitalize our understanding of the data and to avoid redundant work conducted on the same KGs. For this reason, we introduce RuleHub, our extensible corpus of rules for public KGs. RuleHub provides functionalities for the archival and the retrieval of rules to all users, with an extensible architecture that does not constrain the KG or the type of rules supported. We are populating the corpus with thousands of rules from the most popular KGs and report on our experiments on automatically characterizing the quality of a rule with statistical measures.
引用
收藏
页数:22
相关论文
共 42 条
[11]  
Carlson A, 2010, AAAI CONF ARTIF INTE, P1306
[12]   Ontological Pathfinding: Mining First-Order Knowledge from Large Knowledge Bases [J].
Chen, Yang ;
Goldberg, Sean ;
Wang, Daisy Zhe ;
Johri, Soumitra Siddharth .
SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, :835-846
[13]   Discovering Denial Constraints [J].
Chu, Xu ;
Ilyas, Ihab F. ;
Papotti, Paolo .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (13) :1498-1509
[14]   Discovery of frequent DATALOG patterns [J].
Dehaspe, L ;
Toivonen, H .
DATA MINING AND KNOWLEDGE DISCOVERY, 1999, 3 (01) :7-36
[15]   Knowledge Vault: A Web-Scale Approach to Probabilistic Knowledge Fusion [J].
Dong, Xin Luna ;
Gabrilovich, Evgeniy ;
Heitz, Geremy ;
Horn, Wilko ;
Lao, Ni ;
Murphy, Kevin ;
Strohmann, Thomas ;
Sun, Shaohua ;
Zhang, Wei .
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, :601-610
[16]  
Fabian M., 2007, 16 INT WORLD WID WEB, P697
[17]   Discovering Graph Functional Dependencies [J].
Fan, Wenfei ;
Hu, Chunming ;
Liu, Xueli ;
Lu, Ping .
SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, :427-439
[18]   Building Watson: An Overview of the DeepQA Project [J].
Ferrucci, David ;
Brown, Eric ;
Chu-Carroll, Jennifer ;
Fan, James ;
Gondek, David ;
Kalyanpur, Aditya A. ;
Lally, Adam ;
Murdock, J. William ;
Nyberg, Eric ;
Prager, John ;
Schlaefer, Nico ;
Welty, Chris .
AI MAGAZINE, 2010, 31 (03) :59-79
[19]  
Gad-Elrab MohamedH., 2016, ISWC
[20]  
Galárraga L, 2015, VLDB J, V24, P707, DOI 10.1007/s00778-015-0394-1