An Approach to Knowledge Base Completion by a Committee-Based Knowledge Graph Embedding

被引:6
作者
Choi, Su Jeong [1 ]
Song, Hyun-Je [2 ]
Park, Seong-Bae [3 ]
机构
[1] KT, Inst Convergence Technol, 151 Taebong Ro, Seoul 06763, South Korea
[2] Jeonbuk Natl Univ, Dept Informat Technol, 567 Baekje Daero, Jeonju Si 54896, Jeollabuk Do, South Korea
[3] Kyung Hee Univ, Dept Comp Sci & Engn, 1732 Deogyeong Daero, Yongin 17104, Gyeonggi Do, South Korea
来源
APPLIED SCIENCES-BASEL | 2020年 / 10卷 / 08期
基金
新加坡国家研究基金会;
关键词
knowledge base completion; knowledge graph construction; knowledge graph embedding; committee machine;
D O I
10.3390/app10082651
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Knowledge bases such as Freebase, YAGO, DBPedia, and Nell contain a number of facts with various entities and relations. Since they store many facts, they are regarded as core resources for many natural language processing tasks. Nevertheless, they are not normally complete and have many missing facts. Such missing facts keep them from being used in diverse applications in spite of their usefulness. Therefore, it is significant to complete knowledge bases. Knowledge graph embedding is one of the promising approaches to completing a knowledge base and thus many variants of knowledge graph embedding have been proposed. It maps all entities and relations in knowledge base onto a low dimensional vector space. Then, candidate facts that are plausible in the space are determined as missing facts. However, any single knowledge graph embedding is insufficient to complete a knowledge base. As a solution to this problem, this paper defines knowledge base completion as a ranking task and proposes a committee-based knowledge graph embedding model for improving the performance of knowledge base completion. Since each knowledge graph embedding has its own idiosyncrasy, we make up a committee of various knowledge graph embeddings to reflect various perspectives. After ranking all candidate facts according to their plausibility computed by the committee, the top-k facts are chosen as missing facts. Our experimental results on two data sets show that the proposed model achieves higher performance than any single knowledge graph embedding and shows robust performances regardless of k. These results prove that the proposed model considers various perspectives in measuring the plausibility of candidate facts.
引用
收藏
页数:12
相关论文
共 39 条
[1]  
[Anonymous], 2011, P 28 INT C INT C MAC
[2]  
Bollacker Kurt, 2008, P 2008 ACM SIGMOD IN, P1247, DOI DOI 10.1145/1376616.1376746
[3]  
Bordes A., 2013, P ANN C NEUR INF PRO, P2787, DOI DOI 10.5555/2999792.2999923
[4]   A semantic matching energy function for learning with multi-relational data Application to word-sense disambiguation [J].
Bordes, Antoine ;
Glorot, Xavier ;
Weston, Jason ;
Bengio, Yoshua .
MACHINE LEARNING, 2014, 94 (02) :233-259
[5]  
Bordes Antoine, 2011, AAAI, P301, DOI DOI 10.1609/AAAI.V25I1.7917
[6]  
Carlson A, 2010, AAAI CONF ARTIF INTE, P1306
[7]  
Choi SJ, 2016, IEEE C EVOL COMPUTAT, P4958, DOI 10.1109/CEC.2016.7744426
[8]  
Fan M., 2014, P 28 PAC AS C LANG I, P328
[9]   Distributed representation learning for knowledge graphs with entity descriptions [J].
Fan, Miao ;
Zhou, Qiang ;
Zheng, Thomas Fang ;
Grishman, Ralph .
PATTERN RECOGNITION LETTERS, 2017, 93 :31-37
[10]  
Fellbaum C., 1998, WORDNET ELECT LEXICA