VEG-MMKG: Multimodal knowledge graph construction for vegetables based on pre-trained model extraction

被引:0
作者
Lv, Bowen [1 ,2 ,3 ,4 ]
Wu, Huarui [1 ,3 ,4 ]
Chen, Wenbai [2 ]
Chen, Cheng [1 ]
Miao, Yisheng [1 ,3 ,4 ]
Zhao, Chunjiang [1 ]
机构
[1] Natl Engn Res Ctr Informat Technol Agr, Beijing 100097, Peoples R China
[2] Beijing Informat Sci & Technol Univ, Sch Automat, Beijing 100192, Peoples R China
[3] Beijing Acad Agr & Forestry Sci, Informat Technol Res Ctr, Beijing 100097, Peoples R China
[4] Minist Agr & Rural Affairs, Key Lab Digital Village Technol, Beijing 100097, Peoples R China
关键词
Knowledge graph; Multimodal fusion; Image-text pairs; Pre-trained model;
D O I
10.1016/j.compag.2024.109398
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Knowledge graph technology is of great significance to modern agricultural information management and datadriven decision support. However, agricultural knowledge is rich in types, and agricultural knowledge graph databases built only based on text are not conducive to users' intuitive perception and comprehensive understanding of knowledge. In view of this, this paper proposes a solution to extract knowledge and construct an agricultural multimodal knowledge graph using a pre-trained language model. This paper takes two plants, cabbage and corn, as research objects. First, a text-image collaborative representation learning method with a two-stream structure is adopted to combine the image modal information of vegetables with the text modal information, and the correlation and complementarity between the two types of information are used to achieve entity alignment. In addition, in order to solve the problem of high similarity of vegetable entities in small categories, a cross-modal fine-grained contrastive learning method is introduced, and the problem of insufficient semantic association between modalities is solved by contrastive learning of vocabulary and small areas of images. Finally, a visual multimodal knowledge graph user interface is constructed using the results of image and text matching. Experimental results show that the image and text matching efficiency of the fine-tuned pretrained model on the vegetable dataset is 76.7%, and appropriate images can be matched for text entities. The constructed visual multimodal knowledge graph database allows users to query and filter knowledge according to their needs, providing assistance for subsequent research on various applications in specific fields such as multimodal agricultural intelligent question and answer, crop pest and disease identification, and agricultural product recommendations.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Knowledge Graph Construction Based on a Joint Model for Equipment Maintenance
    Lou, Ping
    Yu, Dan
    Jiang, Xuemei
    Hu, Jiwei
    Zeng, Yuhang
    Fan, Chuannian
    [J]. MATHEMATICS, 2023, 11 (17)
  • [42] Research on cross-lingual multi-label patent classification based on pre-trained model
    Lu, Yonghe
    Chen, Lehua
    Tong, Xinyu
    Peng, Yongxin
    Zhu, Hou
    [J]. SCIENTOMETRICS, 2024, 129 (06) : 3067 - 3087
  • [43] PLPMpro: Enhancing promoter sequence prediction with prompt-learning based pre-trained language model
    Li, Zhongshen
    Jin, Junru
    Long, Wentao
    Wei, Leyi
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 164
  • [44] Web-FTP: A Feature Transferring-Based Pre-Trained Model for Web Attack Detection
    Guo, Zhenyu
    Shang, Qinghua
    Li, Xin
    Li, Chengyi
    Zhang, Zijian
    Zhang, Zhuo
    Hu, Jingjing
    An, Jincheng
    Huang, Chuanming
    Chen, Yang
    Cai, Yuguang
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2025, 37 (03) : 1495 - 1507
  • [45] Methods Study of a Unified ABSA Generation Framework Based on Pre-trained Model Induced Dependency Tree
    Xu, Peiyan
    Jin, Guozhe
    Zhao, Yahui
    Cui, Rongyi
    [J]. 2024 4TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND ARTIFICIAL INTELLIGENCE, CCAI 2024, 2024, : 339 - 343
  • [46] Meta-ADD: A meta-learning based pre-trained model for concept drift active detection
    Yu, Hang
    Zhang, Qingyong
    Liu, Tianyu
    Lu, Jie
    Wen, Yimin
    Zhang, Guangquan
    [J]. INFORMATION SCIENCES, 2022, 608 : 996 - 1009
  • [47] The Construction of Knowledge Graphs in the Aviation Assembly Domain Based on a Joint Knowledge Extraction Model
    Liu, Peifeng
    Qian, Lu
    Zhao, Xingwei
    Tao, Bo
    [J]. IEEE ACCESS, 2023, 11 : 26483 - 26495
  • [48] Construction and Application of Feature Recommendation Model for Remote Sensing Interpretation of Rock Strata Based on Knowledge Graph
    Tao, Liufeng
    Wu, Qirui
    Tian, Miao
    Xie, Zhong
    Chen, Jianguo
    Wu, Yueyu
    Qiu, Qinjun
    [J]. REMOTE SENSING, 2025, 17 (06)
  • [49] An AI-enabled pre-trained model-based Covid detection model using chest X-ray images
    Gupta, Rajeev Kumar
    Kunhare, Nilesh
    Pathik, Nikhlesh
    Pathik, Babita
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (26) : 37351 - 37377
  • [50] An AI-enabled pre-trained model-based Covid detection model using chest X-ray images
    Rajeev Kumar Gupta
    Nilesh Kunhare
    Nikhlesh Pathik
    Babita Pathik
    [J]. Multimedia Tools and Applications, 2022, 81 : 37351 - 37377