Integrating Image-Based and Knowledge-Based Representation Learning

被引:7
作者
Xie, Ruobing [1 ]
Heinrich, Stefan [2 ]
Liu, Zhiyuan [3 ,4 ]
Weber, Cornelius [2 ]
Yao, Yuan [3 ,4 ]
Wermter, Stefan [2 ]
Sun, Maosong [3 ,4 ]
机构
[1] Tencent, WeChat Search Applicat Dept, Search Prod Ctr, Shenzhen 518000, Peoples R China
[2] Univ Hamburg, Knowledge Technol Grp, Dept Informat, D-22527 Hamburg, Germany
[3] Tsinghua Univ, Dept Comp Sci & Technol, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
[4] Tsinghua Univ, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
基金
美国国家科学基金会;
关键词
Visualization; Knowledge representation; Brain modeling; Task analysis; Head; Knowledge based systems; Computational modeling; Attention mechanisms and development; embodied cognition; generation of representation during development;
D O I
10.1109/TCDS.2019.2906685
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A variety of brain areas is involved in language understanding and generation, accounting for the scope of language that can refer to many real-world matters. In this paper, we investigate how regularities among real-world entities impact emergent language representations. Specifically, we consider knowledge bases, which represent entities and their relations as structured triples, and image representations, which are obtained via deep convolutional networks. We combine these sources of information to learn representations of an image-based knowledge representation learning (IKRL) model. An attention mechanism lets more informative images contribute more to the image-based representations. Evaluation results show that the model outperforms all baselines on the tasks of knowledge graph (KG) completion and triple classification. In analyzing the learned models, we found that the structure-based and image-based representations integrate different aspects of the entities and the attention mechanism provides robustness during learning.
引用
收藏
页码:169 / 178
页数:10
相关论文
共 57 条
  • [1] [Anonymous], P ADV NEUR INF PROC
  • [2] [Anonymous], 2015, 3 INT C LEARN REPR I
  • [3] [Anonymous], PROC CVPR IEEE
  • [4] [Anonymous], 2015, DEV ROBOTICS BABIES
  • [5] [Anonymous], 2017, COMMUN ACM, DOI DOI 10.1145/3065386
  • [6] [Anonymous], CORR
  • [7] [Anonymous], CORR
  • [8] [Anonymous], 2014, Neural Information Processing Systems
  • [9] VQA: Visual Question Answering
    Antol, Stanislaw
    Agrawal, Aishwarya
    Lu, Jiasen
    Mitchell, Margaret
    Batra, Dhruv
    Zitnick, C. Lawrence
    Parikh, Devi
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2425 - 2433
  • [10] In defense of abstract conceptual representations
    Binder, Jeffrey R.
    [J]. PSYCHONOMIC BULLETIN & REVIEW, 2016, 23 (04) : 1096 - 1108