A Survey on Multimodal Knowledge Graphs: Construction, Completion and Applications

被引:14
作者
Chen, Yong [1 ]
Ge, Xinkai [2 ]
Yang, Shengli [3 ]
Hu, Linmei [4 ]
Li, Jie [2 ]
Zhang, Jinwen [5 ]
机构
[1] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230052, Peoples R China
[2] Beijing Univ Posts & Telecommun, Inst Comp Sci, Beijing 100876, Peoples R China
[3] China Peoples Liberat Army Natl Def Univ, Natl Secur Sch, Beijing 100091, Peoples R China
[4] Beijing Inst Technol, Sch Comp Sci, Beijing 100811, Peoples R China
[5] North Automat Control Technol Inst, Taiyuan 030006, Peoples R China
关键词
multimodal knowledge graph; knowledge graph construction; knowledge graph completion; multimodal knowledge graph application; NAMED ENTITY RECOGNITION; TEXT; ALGORITHM; LINKING; MODEL; WEB;
D O I
10.3390/math11081815
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
As an essential part of artificial intelligence, a knowledge graph describes the real-world entities, concepts and their various semantic relationships in a structured way and has been gradually popularized in a variety practical scenarios. The majority of existing knowledge graphs mainly concentrate on organizing and managing textual knowledge in a structured representation, while paying little attention to the multimodal resources (e.g., pictures and videos), which can serve as the foundation for the machine perception of a real-world data scenario. To this end, in this survey, we comprehensively review the related advances of multimodal knowledge graphs, covering multimodal knowledge graph construction, completion and typical applications. For construction, we outline the methods of named entity recognition, relation extraction and event extraction. For completion, we discuss the multimodal knowledge graph representation learning and entity linking. Finally, the mainstream applications of multimodal knowledge graphs in miscellaneous domains are summarized.
引用
收藏
页数:27
相关论文
共 192 条
  • [101] CM-GANs: Cross-modal Generative Adversarial Networks for Common Representation Learning
    Peng, Yuxin
    Qi, Jinwei
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (01)
  • [102] DeepWalk: Online Learning of Social Representations
    Perozzi, Bryan
    Al-Rfou, Rami
    Skiena, Steven
    [J]. PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 701 - 710
  • [103] Pershina M., 2015, P 2015 C N AM CHAPTE, DOI [10.18653/v1/N18-1167, DOI 10.18653/V1/N18-1167]
  • [104] Peters ME, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P43
  • [105] Petkova D., 2007, P 16 ACM C INF KNOWL, P731, DOI DOI 10.1145/1321440.1321542
  • [106] Pezeshkpour P, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P3208
  • [107] Pingali S, 2021, Arxiv, DOI arXiv:2107.00596
  • [108] Quinlan J. R., 1986, Machine Learning, V1, P81, DOI 10.1023/A:1022643204877
  • [109] Radford A, 2021, PR MACH LEARN RES, V139
  • [110] Radhakrishnan P., 2018, P 2018 C N AM CHAPT, V1, P1844, DOI 10.18653/v1/N18-1167