Visual large language model for wheat disease diagnosis in the wild

被引:1
作者
Zhang, Kunpeng [1 ,2 ]
Ma, Li [1 ]
Cui, Beibei [1 ]
Li, Xin [1 ]
Zhang, Boqiang [3 ]
Xie, Na [4 ]
机构
[1] Henan Univ Technol, Coll Elect Engn, Zhengzhou 450001, Peoples R China
[2] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
[3] Henan Univ Technol, Coll Mech Engn, Zhengzhou 450001, Peoples R China
[4] Cent Univ Finance & Econ, Sch Management Sci & Engn, Beijing 100081, Peoples R China
关键词
Plant disease; Wheat disease diagnosis; Wheat disease classification; Large language model; Explainable AI;
D O I
10.1016/j.compag.2024.109587
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Early detection of symptoms in wheat plants is crucial for mitigating disease effects and preventing their spread. Prompt phytosanitary treatment minimizes yield losses and enhances treatment efficacy. In recent years, numerous image analysis-based methodologies for automatic disease identification have been developed, with Convolutional Neural Networks (CNNs) achieving notable success in visual classification tasks. The existing methods often lack the necessary intelligence and reasoning for real-world applications. This study introduces an advanced wheat disease diagnosis approach using a Visual Language Model (VLM), named the Wheat Disease Language Model (WDLM). The WDLM first leverages the modified Segment Anything Model (SAM) to isolate key wheat features from complex wild environments. To enhance the logical reasoning abilities, the WDLM integrates a reasoning chain to generate clear, reasoned explanations for its diagnosis. By employing dedicated prompt engineering, this study establishes the Wheat Disease Semantic Dataset (WDSD) to fine-tune the VLM. The WDSD, which includes a diverse set of wheat images from various sources, bridges the gap between advanced VLM technology and wheat pathology. Tailored with task-specific data, the WDLM demonstrates superior intelligence by providing accurate classification of wheat diseases and suggesting potential treatment options. Compared to CNN-based models, Transformer-based models, and other VLMs, the WDLM shows improved performance in various scenarios. Integrated with mobile applications, the WDLM approach is readily applicable in the field, representing a promising advancement in the intelligent diagnosis of wheat diseases.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Large language model for patent concept generation
    Ren, Runtao
    Ma, Jian
    Luo, Jianxi
    ADVANCED ENGINEERING INFORMATICS, 2025, 65
  • [22] Generating Simulated Data with a Large Language Model
    Kerley, Jeffrey
    Anderson, Derek T.
    Buck, Andrew R.
    Alvey, Brendan
    SYNTHETIC DATA FOR ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING: TOOLS, TECHNIQUES, AND APPLICATIONS II, 2024, 13035
  • [23] DrugAssist: a large language model for molecule optimization
    Ye, Geyan
    Cai, Xibao
    Lai, Houtim
    Wang, Xing
    Huang, Junhong
    Wang, Longyue
    Liu, Wei
    Zeng, Xiangxiang
    BRIEFINGS IN BIOINFORMATICS, 2025, 26 (01)
  • [24] Applying Object Detection and Large Language Model to Establish a Smart Telemedicine Diagnosis System with Chatbot: A Case Study of Pressure Injuries Diagnosis System
    Chen, Chun-Chia
    Wei, Chia-Jung
    Tseng, Tsung-Yu
    Chiu, Ming-Chuan
    Chang, Chi-Chang
    TELEMEDICINE AND E-HEALTH, 2024, 30 (06) : e1705 - e1712
  • [25] Exploring Vision Language Pretraining with Knowledge Enhancement via Large Language Model
    Tung, Chuenyuet
    Lin, Yi
    Yin, Jianing
    Ye, Qiaoyuchen
    Chen, Hao
    TRUSTWORTHY ARTIFICIAL INTELLIGENCE FOR HEALTHCARE, TAI4H 2024, 2024, 14812 : 81 - 91
  • [26] GalaxyGPT: A Hybrid Framework for Large Language Model Safety
    Zhou, Hange
    Zheng, Jiabin
    Zhang, Longtu
    IEEE ACCESS, 2024, 12 : 94436 - 94451
  • [27] Large Language Model as Unsupervised Health Information Retriever
    Jiang, Keyuan
    Mujtaba, Mohammed M.
    Bernard, Gordon R.
    CARING IS SHARING-EXPLOITING THE VALUE IN DATA FOR HEALTH AND INNOVATION-PROCEEDINGS OF MIE 2023, 2023, 302 : 833 - 834
  • [28] Large language model empowered smart city mobility
    Chen, Yong
    Zhang, Haoyu
    Li, Chuanjia
    Chi, Ben
    Chen, Xiqun
    Wu, Jianjun
    FRONTIERS OF ENGINEERING MANAGEMENT, 2025, 12 (01) : 201 - 207
  • [29] BiomedRAG: A retrieval augmented large language model for biomedicine
    Li, Mingchen
    Kilicoglu, Halil
    Xu, Hua
    Zhang, Rui
    JOURNAL OF BIOMEDICAL INFORMATICS, 2025, 162
  • [30] A survey on large language model based autonomous agents
    Wang, Lei
    Ma, Chen
    Feng, Xueyang
    Zhang, Zeyu
    Yang, Hao
    Zhang, Jingsen
    Chen, Zhiyuan
    Tang, Jiakai
    Chen, Xu
    Lin, Yankai
    Zhao, Wayne Xin
    Wei, Zhewei
    Wen, Jirong
    FRONTIERS OF COMPUTER SCIENCE, 2024, 18 (06)