A Survey on Multimodal Knowledge Graphs: Construction, Completion and Applications

被引:14
作者
Chen, Yong [1 ]
Ge, Xinkai [2 ]
Yang, Shengli [3 ]
Hu, Linmei [4 ]
Li, Jie [2 ]
Zhang, Jinwen [5 ]
机构
[1] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230052, Peoples R China
[2] Beijing Univ Posts & Telecommun, Inst Comp Sci, Beijing 100876, Peoples R China
[3] China Peoples Liberat Army Natl Def Univ, Natl Secur Sch, Beijing 100091, Peoples R China
[4] Beijing Inst Technol, Sch Comp Sci, Beijing 100811, Peoples R China
[5] North Automat Control Technol Inst, Taiyuan 030006, Peoples R China
关键词
multimodal knowledge graph; knowledge graph construction; knowledge graph completion; multimodal knowledge graph application; NAMED ENTITY RECOGNITION; TEXT; ALGORITHM; LINKING; MODEL; WEB;
D O I
10.3390/math11081815
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
As an essential part of artificial intelligence, a knowledge graph describes the real-world entities, concepts and their various semantic relationships in a structured way and has been gradually popularized in a variety practical scenarios. The majority of existing knowledge graphs mainly concentrate on organizing and managing textual knowledge in a structured representation, while paying little attention to the multimodal resources (e.g., pictures and videos), which can serve as the foundation for the machine perception of a real-world data scenario. To this end, in this survey, we comprehensively review the related advances of multimodal knowledge graphs, covering multimodal knowledge graph construction, completion and typical applications. For construction, we outline the methods of named entity recognition, relation extraction and event extraction. For completion, we discuss the multimodal knowledge graph representation learning and entity linking. Finally, the mainstream applications of multimodal knowledge graphs in miscellaneous domains are summarized.
引用
收藏
页数:27
相关论文
共 192 条
  • [1] Adjali Omar, 2020, Advances in Information Retrieval, 42nd European Conference on IR Research, ECIR 2020. Proceedings. Lecture Notes in Computer Science (LNCS 12035), P463, DOI 10.1007/978-3-030-45439-5_31
  • [2] Adjali O, 2020, PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), P4285
  • [3] Alberts Houda, 2021, P 1 WORKSH MULT REPR, P138, DOI DOI 10.18653/V1/2021.MRL-1.13
  • [4] Aliod D.M., 2006, P AUSTR LANG TECHN W, P51
  • [5] [Anonymous], 2015, P 2015 C EMP METH NA, DOI DOI 10.18653/V1/D15-1203
  • [6] [Anonymous], 2003, P 7 INT EAMT WORKSH
  • [7] Aone C., 1998, P 7 MESS UND C P C F
  • [8] Appelt Douglas E., 1995, Proceedings of the 6th conference on Message understanding, MUC6'95, P237, DOI DOI 10.3115/1072399.1072420
  • [9] Arshad Omer, 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR). Proceedings, P337, DOI 10.1109/ICDAR.2019.00061
  • [10] Asgari-Chenaghlu M, 2020, Arxiv, DOI arXiv:2001.06888