Visual large language model for wheat disease diagnosis in the wild

被引:1
作者
Zhang, Kunpeng [1 ,2 ]
Ma, Li [1 ]
Cui, Beibei [1 ]
Li, Xin [1 ]
Zhang, Boqiang [3 ]
Xie, Na [4 ]
机构
[1] Henan Univ Technol, Coll Elect Engn, Zhengzhou 450001, Peoples R China
[2] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
[3] Henan Univ Technol, Coll Mech Engn, Zhengzhou 450001, Peoples R China
[4] Cent Univ Finance & Econ, Sch Management Sci & Engn, Beijing 100081, Peoples R China
关键词
Plant disease; Wheat disease diagnosis; Wheat disease classification; Large language model; Explainable AI;
D O I
10.1016/j.compag.2024.109587
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Early detection of symptoms in wheat plants is crucial for mitigating disease effects and preventing their spread. Prompt phytosanitary treatment minimizes yield losses and enhances treatment efficacy. In recent years, numerous image analysis-based methodologies for automatic disease identification have been developed, with Convolutional Neural Networks (CNNs) achieving notable success in visual classification tasks. The existing methods often lack the necessary intelligence and reasoning for real-world applications. This study introduces an advanced wheat disease diagnosis approach using a Visual Language Model (VLM), named the Wheat Disease Language Model (WDLM). The WDLM first leverages the modified Segment Anything Model (SAM) to isolate key wheat features from complex wild environments. To enhance the logical reasoning abilities, the WDLM integrates a reasoning chain to generate clear, reasoned explanations for its diagnosis. By employing dedicated prompt engineering, this study establishes the Wheat Disease Semantic Dataset (WDSD) to fine-tune the VLM. The WDSD, which includes a diverse set of wheat images from various sources, bridges the gap between advanced VLM technology and wheat pathology. Tailored with task-specific data, the WDLM demonstrates superior intelligence by providing accurate classification of wheat diseases and suggesting potential treatment options. Compared to CNN-based models, Transformer-based models, and other VLMs, the WDLM shows improved performance in various scenarios. Integrated with mobile applications, the WDLM approach is readily applicable in the field, representing a promising advancement in the intelligent diagnosis of wheat diseases.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] A hybrid framework with large language models for rare disease phenotyping
    Wu, Jinge
    Dong, Hang
    Li, Zexi
    Wang, Haowei
    Li, Runci
    Patra, Arijit
    Dai, Chengliang
    Ali, Waqar
    Scordis, Phil
    Wu, Honghan
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
  • [42] Large Language Model Inference Acceleration Based on Hybrid Model Branch Prediction
    Duan, Gaoxiang
    Chen, Jiajie
    Zhou, Yueying
    Zheng, Xiaoying
    Zhu, Yongxin
    ELECTRONICS, 2024, 13 (07)
  • [43] LLM-TSFD: An industrial time series human-in-the-loop fault diagnosis method based on a large language model
    Zhang, Qi
    Xu, Chao
    Li, Jie
    Sun, Yicheng
    Bao, Jinsong
    Zhang, Dan
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 264
  • [44] Multi large language model collaboration framework for few-shot link prediction in evolutionary fault diagnosis event graphs
    Wang, Tian
    Wang, Ping
    Yang, Feng
    Wang, Shuai
    Fang, Qiang
    Chi, Meng
    JOURNAL OF PROCESS CONTROL, 2025, 145
  • [45] DB-GPT: Large Language Model Meets Database
    Zhou, Xuanhe
    Sun, Zhaoyan
    Li, Guoliang
    DATA SCIENCE AND ENGINEERING, 2024, 9 (01) : 102 - 111
  • [46] LLMcap: Large Language Model for Unsupervised PCAP Failure Detection
    Tulczyjew, Lukasz
    Jarrah, Kinan
    Abondo, Charles
    Bennett, Dina
    Weill, Nathanael
    2024 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS, ICC WORKSHOPS 2024, 2024, : 1559 - 1565
  • [47] TCMChat: A generative large language model for traditional Chinese medicine
    Dai, Yizheng
    Shao, Xin
    Zhang, Jinlu
    Chen, Yulong
    Chen, Qian
    Liao, Jie
    Chi, Fei
    Zhang, Junhua
    Fan, Xiaohui
    PHARMACOLOGICAL RESEARCH, 2024, 210
  • [48] DeBERTa-GRU: Sentiment Analysis for Large Language Model
    Assiri, Adel
    Gumaei, Abdu
    Mehmood, Faisal
    Abbas, Touqeer
    Ullah, Sami
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (03): : 4219 - 4236
  • [49] Mystery Game Script Compose Based on a Large Language Model
    Li, Jiameng
    Chen, Zhen
    Lin, Weiran
    Zou, Liangjun
    Xie, Xin
    Hu, Yaodong
    Li, Dianmo
    2024 IEEE 5TH ANNUAL WORLD AI IOT CONGRESS, AIIOT 2024, 2024, : 0451 - 0455
  • [50] Ontology extension by online clustering with large language model agents
    Wu, Guanchen
    Ling, Chen
    Graetz, Ilana
    Zhao, Liang
    FRONTIERS IN BIG DATA, 2024, 7