A Survey on Multimodal Knowledge Graphs: Construction, Completion and Applications

被引:14
作者
Chen, Yong [1 ]
Ge, Xinkai [2 ]
Yang, Shengli [3 ]
Hu, Linmei [4 ]
Li, Jie [2 ]
Zhang, Jinwen [5 ]
机构
[1] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230052, Peoples R China
[2] Beijing Univ Posts & Telecommun, Inst Comp Sci, Beijing 100876, Peoples R China
[3] China Peoples Liberat Army Natl Def Univ, Natl Secur Sch, Beijing 100091, Peoples R China
[4] Beijing Inst Technol, Sch Comp Sci, Beijing 100811, Peoples R China
[5] North Automat Control Technol Inst, Taiyuan 030006, Peoples R China
关键词
multimodal knowledge graph; knowledge graph construction; knowledge graph completion; multimodal knowledge graph application; NAMED ENTITY RECOGNITION; TEXT; ALGORITHM; LINKING; MODEL; WEB;
D O I
10.3390/math11081815
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
As an essential part of artificial intelligence, a knowledge graph describes the real-world entities, concepts and their various semantic relationships in a structured way and has been gradually popularized in a variety practical scenarios. The majority of existing knowledge graphs mainly concentrate on organizing and managing textual knowledge in a structured representation, while paying little attention to the multimodal resources (e.g., pictures and videos), which can serve as the foundation for the machine perception of a real-world data scenario. To this end, in this survey, we comprehensively review the related advances of multimodal knowledge graphs, covering multimodal knowledge graph construction, completion and typical applications. For construction, we outline the methods of named entity recognition, relation extraction and event extraction. For completion, we discuss the multimodal knowledge graph representation learning and entity linking. Finally, the mainstream applications of multimodal knowledge graphs in miscellaneous domains are summarized.
引用
收藏
页数:27
相关论文
共 192 条
  • [61] Krishnan V, 2006, COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, P1121
  • [62] Lafferty J., 2001, P 18 INT C MACH LEAR, P282, DOI DOI 10.5555/645530.655813
  • [63] Lazic N., 2015, Transactions of the Association for Computational Linguistics, V3, P503, DOI [DOI 10.1162/TACL_A_00154, 10.1162/tacl_a_00154]
  • [64] Le P, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P4081
  • [65] Semantic Relation Classification via Bidirectional LSTM Networks with Entity-Aware Attention Using Latent Entity Typing
    Lee, Joohong
    Seo, Sangwoo
    Choi, Yong Suk
    [J]. SYMMETRY-BASEL, 2019, 11 (06):
  • [66] A Survey on Deep Learning for Named Entity Recognition
    Li, Jing
    Sun, Aixin
    Han, Jianglei
    Li, Chenliang
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (01) : 50 - 70
  • [67] Li ML, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P2557
  • [68] Li Q, 2013, ANN M ASS COMP LING, V1, P73, DOI DOI 10.1021/BI00231A020
  • [69] A hybrid deep-learning approach for complex biochemical named entity recognition
    Liu, Jian
    Gao, Lei
    Guo, Sujie
    Ding, Rui
    Huang, Xin
    Ye, Long
    Meng, Qinghua
    Nazari, Asef
    Thiruvady, Dhananjay
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 221
  • [70] Liu TY, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P2195