Rule-based data augmentation for knowledge graph embedding

被引:3
作者
Li, Guangyao
Sun, Zequn
Qian, Lei [1 ,2 ]
Guo, Qiang [1 ,2 ]
Hu, Wei [1 ,2 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] State Key Lab Math Engn & Adv Comp, Wuxi, Peoples R China
来源
AI OPEN | 2021年 / 2卷
基金
中国国家自然科学基金;
关键词
Knowledge graph embedding; Data augmentation; Logical rules;
D O I
10.1016/j.aiopen.2021.09.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge graph (KG) embedding models suffer from the incompleteness issue of observed facts. Different from existing solutions that incorporate additional information or employ expressive and complex embedding techniques, we propose to augment KGs by iteratively mining logical rules from the observed facts and then using the rules to generate new relational triples. We incrementally train KG embeddings with the coming of new augmented triples, and leverage the embeddings to validate these new triples. To guarantee the quality of the augmented data, we filter out the noisy triples based on a propagation mechanism during the validation. The mined rules and rule groundings are human -understandable, and can make the augmentation procedure reliable. Our KG augmentation framework is applicable to any KG embedding models with no need to modify their embedding techniques. Our experiments on two popular embedding -based tasks (i.e., entity alignment and link prediction) show that the proposed framework can bring significant improvement to existing KG embedding models on most benchmark datasets.
引用
收藏
页码:186 / 196
页数:11
相关论文
共 43 条
  • [31] Trouillon T, 2016, PR MACH LEARN RES, V48
  • [32] Vashishth S., 2020, Composition-based multirelational graph convolutional networks
  • [33] Velickovic P., 2018, P 6 INT C LEARN REPR
  • [34] Wang Z, 2014, AAAI CONF ARTIF INTE, P1112
  • [35] Wang ZC, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P349
  • [36] Xie Q., 2020, ADV NEURAL INF PROCE, V33, P6256
  • [37] Xie Z., 2019, Data Noising as Smoothing in Neural Network Language Models
  • [38] Xu K, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P3156
  • [39] Yang B., 2014, 3 INT C LEARNING REP
  • [40] Iteratively Learning Embeddings and Rules for Knowledge Graph Reasoning
    Zhang, Wen
    Paudel, Bibek
    Wang, Liang
    Chen, Jiaoyan
    Zhu, Hai
    Zhang, Wei
    Bernstein, Abraham
    Chen, Huajun
    [J]. WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 2366 - 2377