Anytime bottom-up rule learning for large-scale knowledge graph completion

被引：5

作者：

Meilicke, Christian ^{[1
]}

Chekol, Melisachew Wudage ^{[2
]}

Betz, Patrick ^{[1
]}

Fink, Manuel ^{[1
]}

Stuckeschmidt, Heiner ^{[1
]}

机构：

[1] Univ Mannheim, Mannheim, Germany

[2] Univ Utrecht, Utrecht, Netherlands

来源：

VLDB JOURNAL | 2024年 / 33卷 / 01期

关键词：

Knowledge graph completion; Link prediction; Rule learning;

D O I：

10.1007/s00778-023-00800-5

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Knowledge graph completion is the task of predicting correct facts that can be expressed by the vocabulary of a given knowledge graph, which are not explicitly stated in that graph. Broadly, there are two main approaches for solving the knowledge graph completion problem. Sub-symbolic approaches embed the nodes and/or edges of a given graph into a low-dimensional vector space and use a scoring function to determine the plausibility of a given fact. Symbolic approaches learn a model that remains within the primary representation of the given knowledge graph. Rule-based approaches are well-known examples. One such approach is AnyBURL. It works by sampling random paths, which are generalized into Horn rules. Previously published results show that the prediction quality of AnyBURL is close to current state of the art with the additional benefit of offering an explanation for a predicted fact. In this paper, we propose several improvements and extensions of AnyBURL. In particular, we focus on AnyBURL's capability to be successfully applied to large and very large datasets. Overall, we propose four separate extensions: (i) We add to each rule a set of pairwise inequality constraints which enforces that different variables cannot be grounded by the same entities, which results into more appropriate confidence estimations. (ii) We introduce reinforcement learning to guide path sampling in order to use available computational resources more efficiently. (iii) We propose an efficient sampling strategy to approximate the confidence of a rule instead of computing its exact value. (iv) We develop a new multithreaded AnyBURL, which incorporates all previously mentioned modifications. In an experimental study, we show that our approach outperforms both symbolic and sub-symbolic approaches in large-scale knowledge graph completion. It has a higher prediction quality and requires significantly less time and computational resources.

引用

页码：131 / 161

页数：31

共 67 条

[1] [Anonymous], 1996, P MLNET FAM WORKSH D
[2] DBpedia: A nucleus for a web of open data
Auer, Soeren
Bizer, Christian
Kobilarov, Georgi
Lehmann, Jens
Cyganiak, Richard
Ives, Zachary
[J]. SEMANTIC WEB, PROCEEDINGS, 2007, 4825 : 722 - +
[3] Graph Isomorphism in Quasipolynomial Time [Extended Abstract]
Babai, Laszlo
[J]. STOC'16: PROCEEDINGS OF THE 48TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING, 2016, : 684 - 697
[4] Improving Visual Relationship Detection Using Semantic Modeling of Scene Descriptions
Baier, Stephan
Ma, Yunpu
Tresp, Volker
[J]. SEMANTIC WEB - ISWC 2017, PT I, 2017, 10587 : 53 - 68
[5] Balazevic I, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P5185
[6] Betz P, 2022, PROCEEDINGS OF THE THIRTY-FIRST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2022, P2820
[7] Supervised Knowledge Aggregation for Knowledge Graph Completion
Betz, Patrick
Meilicke, Christian
Stuckenschmidt, Heiner
[J]. SEMANTIC WEB, ESWC 2022, 2022, 13261 : 74 - 92
[8] Bollacker K., 2008, P ACM SIGMOD INT C M, P1247, DOI DOI 10.5555/1619797.1619981
[9] Bordes A, 2013, Proceedings of NIPS, P26
[10] A semantic matching energy function for learning with multi-relational data Application to word-sense disambiguation
Bordes, Antoine
Glorot, Xavier
Weston, Jason
Bengio, Yoshua
[J]. MACHINE LEARNING, 2014, 94 (02) : 233 - 259

← 1 2 3 4 5 6 7 →