VEG-MMKG: Multimodal knowledge graph construction for vegetables based on pre-trained model extraction

被引:0
作者
Lv, Bowen [1 ,2 ,3 ,4 ]
Wu, Huarui [1 ,3 ,4 ]
Chen, Wenbai [2 ]
Chen, Cheng [1 ]
Miao, Yisheng [1 ,3 ,4 ]
Zhao, Chunjiang [1 ]
机构
[1] Natl Engn Res Ctr Informat Technol Agr, Beijing 100097, Peoples R China
[2] Beijing Informat Sci & Technol Univ, Sch Automat, Beijing 100192, Peoples R China
[3] Beijing Acad Agr & Forestry Sci, Informat Technol Res Ctr, Beijing 100097, Peoples R China
[4] Minist Agr & Rural Affairs, Key Lab Digital Village Technol, Beijing 100097, Peoples R China
关键词
Knowledge graph; Multimodal fusion; Image-text pairs; Pre-trained model;
D O I
10.1016/j.compag.2024.109398
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Knowledge graph technology is of great significance to modern agricultural information management and datadriven decision support. However, agricultural knowledge is rich in types, and agricultural knowledge graph databases built only based on text are not conducive to users' intuitive perception and comprehensive understanding of knowledge. In view of this, this paper proposes a solution to extract knowledge and construct an agricultural multimodal knowledge graph using a pre-trained language model. This paper takes two plants, cabbage and corn, as research objects. First, a text-image collaborative representation learning method with a two-stream structure is adopted to combine the image modal information of vegetables with the text modal information, and the correlation and complementarity between the two types of information are used to achieve entity alignment. In addition, in order to solve the problem of high similarity of vegetable entities in small categories, a cross-modal fine-grained contrastive learning method is introduced, and the problem of insufficient semantic association between modalities is solved by contrastive learning of vocabulary and small areas of images. Finally, a visual multimodal knowledge graph user interface is constructed using the results of image and text matching. Experimental results show that the image and text matching efficiency of the fine-tuned pretrained model on the vegetable dataset is 76.7%, and appropriate images can be matched for text entities. The constructed visual multimodal knowledge graph database allows users to query and filter knowledge according to their needs, providing assistance for subsequent research on various applications in specific fields such as multimodal agricultural intelligent question and answer, crop pest and disease identification, and agricultural product recommendations.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Pre-Trained Model-Based NFR Classification: Overcoming Limited Data Challenges
    Rahman, Kiramat
    Ghani, Anwar
    Alzahrani, Abdulrahman
    Tariq, Muhammad Usman
    Rahman, Arif Ur
    IEEE ACCESS, 2023, 11 : 81787 - 81802
  • [32] SSMFRP: Semantic Similarity Model for Relation Prediction in KBQA Based on Pre-trained Models
    Wang, Ziming
    Xu, Xirong
    Li, Xinzi
    Song, Xiaoying
    Wei, Xiaopeng
    Huang, Degen
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 294 - 306
  • [33] Millitary Knowledge Graph Construction Based on Universal Information Extraction Models
    Miao Yongfei
    Zhang Yihang
    Wang Li
    Song Xiaoxue
    Song Yuze
    Tang Zekun
    2024 10TH INTERNATIONAL CONFERENCE ON BIG DATA AND INFORMATION ANALYTICS, BIGDIA 2024, 2024, : 877 - 881
  • [34] A Method for Judicial Case Knowledge Graph Construction Based on Event Extraction
    Zhao, Bang
    Zhao, Yilong
    Mao, Ying
    PROCEEDINGS OF THE 2024 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION TECHNOLOGY, ICIIT 2024, 2024, : 62 - 69
  • [35] Research on Dual-adversarial MR Image Fusion Network Using Pre-trained Model for Feature Extraction
    Liu H.
    Li S.-S.
    Gao S.-S.
    Deng K.
    Xu G.
    Zhang C.-M.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (05): : 2134 - 2151
  • [36] Missing monitoring data reconstruction for cable-stayed bridge using knowledge transfer-based generative pre-trained model
    Zhang, Minte
    Guo, Tong
    Zhang, Guodong
    Liu, Zhongxiang
    Liu, Yajie
    STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2025,
  • [37] Pre-Trained Model-Based Automated Software Vulnerability Repair: How Far are We?
    Zhang, Quanjun
    Fang, Chunrong
    Yu, Bowen
    Sun, Weisong
    Zhang, Tongke
    Chen, Zhenyu
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2024, 21 (04) : 2507 - 2525
  • [38] Pre-trained Model-based Software Defect Prediction for Edge-cloud Systems
    Kwon, Sunjae
    Lee, Sungu
    Ryu, Duksan
    Baik, Jongmoon
    JOURNAL OF WEB ENGINEERING, 2023, 22 (02): : 255 - 278
  • [39] Entity Recognition for Chinese Hazardous Chemical Accident Data Based on Rules and a Pre-Trained Model
    Dai, Hui
    Zhu, Mu
    Yuan, Guan
    Niu, Yaowei
    Shi, Hongxing
    Chen, Boxuan
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [40] Migratable urban street scene sensing method based on vision language pre-trained model
    Zhang, Yan
    Zhang, Fan
    Chen, Nengcheng
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 113