Rule-based data augmentation for knowledge graph embedding

被引:3
作者
Li, Guangyao
Sun, Zequn
Qian, Lei [1 ,2 ]
Guo, Qiang [1 ,2 ]
Hu, Wei [1 ,2 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] State Key Lab Math Engn & Adv Comp, Wuxi, Peoples R China
来源
AI OPEN | 2021年 / 2卷
基金
中国国家自然科学基金;
关键词
Knowledge graph embedding; Data augmentation; Logical rules;
D O I
10.1016/j.aiopen.2021.09.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge graph (KG) embedding models suffer from the incompleteness issue of observed facts. Different from existing solutions that incorporate additional information or employ expressive and complex embedding techniques, we propose to augment KGs by iteratively mining logical rules from the observed facts and then using the rules to generate new relational triples. We incrementally train KG embeddings with the coming of new augmented triples, and leverage the embeddings to validate these new triples. To guarantee the quality of the augmented data, we filter out the noisy triples based on a propagation mechanism during the validation. The mined rules and rule groundings are human -understandable, and can make the augmentation procedure reliable. Our KG augmentation framework is applicable to any KG embedding models with no need to modify their embedding techniques. Our experiments on two popular embedding -based tasks (i.e., entity alignment and link prediction) show that the proposed framework can bring significant improvement to existing KG embedding models on most benchmark datasets.
引用
收藏
页码:186 / 196
页数:11
相关论文
共 43 条
  • [1] Balazevic I., 2019, Tensor Factorization for Knowledge Graph Completion, P5184
  • [2] Bordes A, 2013, Advances in neural information processing systems, DOI DOI 10.5555/2999792.2999923
  • [3] Open Knowledge Enrichment for Long-tail Entities
    Cao, Ermei
    Wang, Difeng
    Huang, Jiacheng
    Hu, Wei
    [J]. WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 384 - 394
  • [4] Cao YX, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P1452
  • [5] Chen MH, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P3998
  • [6] Chen MH, 2017, PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1511
  • [7] Dettmers T, 2018, AAAI CONF ARTIF INTE, P1811
  • [8] Ding BY, 2018, PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, P110
  • [9] Galarraga L. A., 2013, WWW, P413
  • [10] Predicting Completeness in Knowledge Bases
    Galarraga, Luis
    Razniewski, Simon
    Amarilli, Antoine
    Suchanek, Fabian M.
    [J]. WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2017, : 375 - 383