Visual large language model for wheat disease diagnosis in the wild

被引:1
作者
Zhang, Kunpeng [1 ,2 ]
Ma, Li [1 ]
Cui, Beibei [1 ]
Li, Xin [1 ]
Zhang, Boqiang [3 ]
Xie, Na [4 ]
机构
[1] Henan Univ Technol, Coll Elect Engn, Zhengzhou 450001, Peoples R China
[2] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
[3] Henan Univ Technol, Coll Mech Engn, Zhengzhou 450001, Peoples R China
[4] Cent Univ Finance & Econ, Sch Management Sci & Engn, Beijing 100081, Peoples R China
关键词
Plant disease; Wheat disease diagnosis; Wheat disease classification; Large language model; Explainable AI;
D O I
10.1016/j.compag.2024.109587
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Early detection of symptoms in wheat plants is crucial for mitigating disease effects and preventing their spread. Prompt phytosanitary treatment minimizes yield losses and enhances treatment efficacy. In recent years, numerous image analysis-based methodologies for automatic disease identification have been developed, with Convolutional Neural Networks (CNNs) achieving notable success in visual classification tasks. The existing methods often lack the necessary intelligence and reasoning for real-world applications. This study introduces an advanced wheat disease diagnosis approach using a Visual Language Model (VLM), named the Wheat Disease Language Model (WDLM). The WDLM first leverages the modified Segment Anything Model (SAM) to isolate key wheat features from complex wild environments. To enhance the logical reasoning abilities, the WDLM integrates a reasoning chain to generate clear, reasoned explanations for its diagnosis. By employing dedicated prompt engineering, this study establishes the Wheat Disease Semantic Dataset (WDSD) to fine-tune the VLM. The WDSD, which includes a diverse set of wheat images from various sources, bridges the gap between advanced VLM technology and wheat pathology. Tailored with task-specific data, the WDLM demonstrates superior intelligence by providing accurate classification of wheat diseases and suggesting potential treatment options. Compared to CNN-based models, Transformer-based models, and other VLMs, the WDLM shows improved performance in various scenarios. Integrated with mobile applications, the WDLM approach is readily applicable in the field, representing a promising advancement in the intelligent diagnosis of wheat diseases.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Scalable Mentoring Support with a Large Language Model Chatbot
    Soliman, Hassan
    Kravcik, Milos
    Neumann, Alexander Tobias
    Yin, Yue
    Pengel, Norbert
    Haag, Maike
    TECHNOLOGY ENHANCED LEARNING FOR INCLUSIVE AND EQUITABLE QUALITY EDUCATION, PT II, EC-TEL 2024, 2024, 15160 : 260 - 266
  • [32] Large Language Model Powered Agents for Information Retrieval
    Zhang, An
    Deng, Yang
    Lin, Yankai
    Chen, Xu
    Wen, Ji-Rong
    Chua, Tat-Seng
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2989 - 2992
  • [33] Large Language Model for Geometric Algebra: A Preliminary Attempt
    Wang, Jian
    Wang, Ziqiang
    Wang, Han
    Luo, Wen
    Yuan, Linwang
    Lu, Guonian
    Yu, Zhaoyuan
    ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT IV, 2024, 14498 : 237 - 249
  • [34] Comparison of Large Language Models in Diagnosis and Management of Challenging Clinical Cases
    Shanmugam, Sujeeth Krishna
    Browning, David J.
    CLINICAL OPHTHALMOLOGY, 2024, 18 : 3239 - 3247
  • [35] A survey on large language model based autonomous agents
    WANG Lei
    MA Chen
    FENG Xueyang
    ZHANG Zeyu
    YANG Hao
    ZHANG Jingsen
    CHEN Zhiyuan
    TANG Jiakai
    CHEN Xu
    LIN Yankai
    ZHAO Wayne Xin
    WEI Zhewei
    WEN Jirong
    Frontiers of Computer Science, 2024, 18 (06)
  • [36] A survey on large language model based autonomous agents
    Lei Wang
    Chen Ma
    Xueyang Feng
    Zeyu Zhang
    Hao Yang
    Jingsen Zhang
    Zhiyuan Chen
    Jiakai Tang
    Xu Chen
    Yankai Lin
    Wayne Xin Zhao
    Zhewei Wei
    Jirong Wen
    Frontiers of Computer Science, 2024, 18
  • [37] Kgent: Kernel Extensions Large Language Model Agent
    Zheng, Yusheng
    Yang, Yiwei
    Chen, Maolin
    Quinn, Andrew
    PROCEEDINGS OF THE ACM SIGCOMM 2024 WORKSHOP ON EBPF AND KERNEL EXTENSIONS, EBPF 2024, 2024, : 33 - 39
  • [38] Evaluating a Large Language Model on Searching for GUI Layouts
    Brie P.
    Burny N.
    Sluyters A.
    Vanderdonckt J.
    Proceedings of the ACM on Human-Computer Interaction, 2023, 7 (EICS)
  • [39] Explainable Fault Diagnosis of Control Systems Using Large Language Models
    Ojuolape, Adewumi Emmanuel
    Hu, Shanfeng
    2024 IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS, CCTA 2024, 2024, : 491 - 498
  • [40] Research on Psychological Test based on Large Language Model
    Liu, Zhengzheng
    Li, Xinying
    Kang, Yunfeng
    2024 3RD INTERNATIONAL CONFERENCE ON ROBOTICS, ARTIFICIAL INTELLIGENCE AND INTELLIGENT CONTROL, RAIIC 2024, 2024, : 503 - 510