A structure distinguishable graph attention network for knowledge base completion

被引:5
作者
Zhou, Xue [2 ]
Hui, Bei [3 ]
Zhang, Lizong [1 ,3 ]
Ji, Kexi [2 ]
机构
[1] 2006,Xiyuan Ave,West Hitech Zone, Chengdu 611731, Sichuan, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu 611731, Peoples R China
[3] Univ Elect Sci & Technol China, Trusted Cloud Comp & Big Data Key Lab Sichuan Pro, Chengdu 611731, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Knowledge base completion; Smoothing problem; Graph attention network; Neighborhood aggregation scheme;
D O I
10.1007/s00521-021-06221-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A knowledge graph is a collection of triples, often represented in the form of "subject," "relation," "object." The task of knowledge graph completion (KGC) is to automatically predict missing links by reasoning over the information already present in the knowledge graph. Recent popularization of graph neural networks has also been spread to KGC. Typical techniques like SACN achieve dramatic achievements and beat previous state-of-the-art. However, those models still lack the ability to distinguish different local structures within a graph, which leads to the over smoothing problem. In this work, we propose SD-GAT, a graph attention network with a structure-distinguishable neighborhood aggregation scheme, which models the injective function to aggregate information from the neighborhood. The model is constituted of two modules. The encoder is a graph attention network that improved with our neighborhood aggregation scheme, which could be applied for a more distinct representation of entities and relations. The decoder is a convolutional neural network using 3 x 3 convolution filters. Our empirical research provides an effective solution to increase the discriminative power of graph attention networks, and we show significant improvement of the proposed SD-GAT compared to the state-of-the-art methods on standard FB15K-237 and WN18RR datasets.
引用
收藏
页码:16005 / 16017
页数:13
相关论文
共 41 条
  • [1] Balazevic I, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P5185
  • [2] Bansal T, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P4387
  • [3] Bordes A., 2013, ADV NEURAL INFORM PR, V26, P2787, DOI DOI 10.5555/2999792.2999923
  • [4] A Private and Efficient Mechanism for Data Uploading in Smart Cyber-Physical Systems
    Cai, Zhipeng
    Zheng, Xu
    [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2020, 7 (02): : 766 - 775
  • [5] Carlson A, 2010, AAAI CONF ARTIF INTE, P1306
  • [6] Extracting Kernel Dataset from Big Sensory Data in Wireless Sensor Networks
    Cheng, Siyao
    Cai, Zhipeng
    Li, Jianzhong
    Gao, Hong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (04) : 813 - 827
  • [7] Choi E, 2015, PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, P1311
  • [8] Dettmers T, 2018, AAAI CONF ARTIF INTE, P1811
  • [9] Gilmer Justin., 2017, Proceedings of the 34th International Conference on Machine Learning, V70, P1263
  • [10] APPROXIMATION CAPABILITIES OF MULTILAYER FEEDFORWARD NETWORKS
    HORNIK, K
    [J]. NEURAL NETWORKS, 1991, 4 (02) : 251 - 257