Integrating Image-Based and Knowledge-Based Representation Learning

被引：7

作者：

Xie, Ruobing ^{[1
]}

Heinrich, Stefan ^{[2
]}

Liu, Zhiyuan ^{[3
,4
]}

Weber, Cornelius ^{[2
]}

Yao, Yuan ^{[3
,4
]}

Wermter, Stefan ^{[2
]}

Sun, Maosong ^{[3
,4
]}

机构：

[1] Tencent, WeChat Search Applicat Dept, Search Prod Ctr, Shenzhen 518000, Peoples R China

[2] Univ Hamburg, Knowledge Technol Grp, Dept Informat, D-22527 Hamburg, Germany

[3] Tsinghua Univ, Dept Comp Sci & Technol, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China

[4] Tsinghua Univ, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China

来源：

IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS | 2020年 / 12卷 / 02期

基金：

美国国家科学基金会;

关键词：

Visualization; Knowledge representation; Brain modeling; Task analysis; Head; Knowledge based systems; Computational modeling; Attention mechanisms and development; embodied cognition; generation of representation during development;

D O I：

10.1109/TCDS.2019.2906685

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A variety of brain areas is involved in language understanding and generation, accounting for the scope of language that can refer to many real-world matters. In this paper, we investigate how regularities among real-world entities impact emergent language representations. Specifically, we consider knowledge bases, which represent entities and their relations as structured triples, and image representations, which are obtained via deep convolutional networks. We combine these sources of information to learn representations of an image-based knowledge representation learning (IKRL) model. An attention mechanism lets more informative images contribute more to the image-based representations. Evaluation results show that the model outperforms all baselines on the tasks of knowledge graph (KG) completion and triple classification. In analyzing the learned models, we found that the structure-based and image-based representations integrate different aspects of the entities and the attention mechanism provides robustness during learning.

引用

页码：169 / 178

页数：10

共 57 条

[1] [Anonymous], P ADV NEUR INF PROC
[2] [Anonymous], 2015, 3 INT C LEARN REPR I
[3] [Anonymous], PROC CVPR IEEE
[4] [Anonymous], 2015, DEV ROBOTICS BABIES
[5] [Anonymous], 2017, COMMUN ACM, DOI DOI 10.1145/3065386
[6] [Anonymous], CORR
[7] [Anonymous], CORR
[8] [Anonymous], 2014, Neural Information Processing Systems
[9] VQA: Visual Question Answering
Antol, Stanislaw
Agrawal, Aishwarya
Lu, Jiasen
Mitchell, Margaret
Batra, Dhruv
Zitnick, C. Lawrence
Parikh, Devi
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2425 - 2433
[10] In defense of abstract conceptual representations
Binder, Jeffrey R.
[J]. PSYCHONOMIC BULLETIN & REVIEW, 2016, 23 (04) : 1096 - 1108

← 1 2 3 4 5 6 →