Anytime bottom-up rule learning for large-scale knowledge graph completion

被引:5
作者
Meilicke, Christian [1 ]
Chekol, Melisachew Wudage [2 ]
Betz, Patrick [1 ]
Fink, Manuel [1 ]
Stuckeschmidt, Heiner [1 ]
机构
[1] Univ Mannheim, Mannheim, Germany
[2] Univ Utrecht, Utrecht, Netherlands
关键词
Knowledge graph completion; Link prediction; Rule learning;
D O I
10.1007/s00778-023-00800-5
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Knowledge graph completion is the task of predicting correct facts that can be expressed by the vocabulary of a given knowledge graph, which are not explicitly stated in that graph. Broadly, there are two main approaches for solving the knowledge graph completion problem. Sub-symbolic approaches embed the nodes and/or edges of a given graph into a low-dimensional vector space and use a scoring function to determine the plausibility of a given fact. Symbolic approaches learn a model that remains within the primary representation of the given knowledge graph. Rule-based approaches are well-known examples. One such approach is AnyBURL. It works by sampling random paths, which are generalized into Horn rules. Previously published results show that the prediction quality of AnyBURL is close to current state of the art with the additional benefit of offering an explanation for a predicted fact. In this paper, we propose several improvements and extensions of AnyBURL. In particular, we focus on AnyBURL's capability to be successfully applied to large and very large datasets. Overall, we propose four separate extensions: (i) We add to each rule a set of pairwise inequality constraints which enforces that different variables cannot be grounded by the same entities, which results into more appropriate confidence estimations. (ii) We introduce reinforcement learning to guide path sampling in order to use available computational resources more efficiently. (iii) We propose an efficient sampling strategy to approximate the confidence of a rule instead of computing its exact value. (iv) We develop a new multithreaded AnyBURL, which incorporates all previously mentioned modifications. In an experimental study, we show that our approach outperforms both symbolic and sub-symbolic approaches in large-scale knowledge graph completion. It has a higher prediction quality and requires significantly less time and computational resources.
引用
收藏
页码:131 / 161
页数:31
相关论文
共 67 条
  • [1] [Anonymous], 1996, P MLNET FAM WORKSH D
  • [2] DBpedia: A nucleus for a web of open data
    Auer, Soeren
    Bizer, Christian
    Kobilarov, Georgi
    Lehmann, Jens
    Cyganiak, Richard
    Ives, Zachary
    [J]. SEMANTIC WEB, PROCEEDINGS, 2007, 4825 : 722 - +
  • [3] Graph Isomorphism in Quasipolynomial Time [Extended Abstract]
    Babai, Laszlo
    [J]. STOC'16: PROCEEDINGS OF THE 48TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING, 2016, : 684 - 697
  • [4] Improving Visual Relationship Detection Using Semantic Modeling of Scene Descriptions
    Baier, Stephan
    Ma, Yunpu
    Tresp, Volker
    [J]. SEMANTIC WEB - ISWC 2017, PT I, 2017, 10587 : 53 - 68
  • [5] Balazevic I, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P5185
  • [6] Betz P, 2022, PROCEEDINGS OF THE THIRTY-FIRST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2022, P2820
  • [7] Supervised Knowledge Aggregation for Knowledge Graph Completion
    Betz, Patrick
    Meilicke, Christian
    Stuckenschmidt, Heiner
    [J]. SEMANTIC WEB, ESWC 2022, 2022, 13261 : 74 - 92
  • [8] Bollacker K., 2008, P ACM SIGMOD INT C M, P1247, DOI DOI 10.5555/1619797.1619981
  • [9] Bordes A, 2013, Proceedings of NIPS, P26
  • [10] A semantic matching energy function for learning with multi-relational data Application to word-sense disambiguation
    Bordes, Antoine
    Glorot, Xavier
    Weston, Jason
    Bengio, Yoshua
    [J]. MACHINE LEARNING, 2014, 94 (02) : 233 - 259