Crop Disease Diagnosis with Deep Learning-Based Image Captioning and Object Detection

被引:10
作者
Lee, Dong In [1 ]
Lee, Ji Hwan [2 ]
Jang, Seung Ho [3 ]
Oh, Se Jong [4 ]
Doo, Ill Chul [4 ]
机构
[1] Hankuk Univ Foreign Studies, Comp & Elect Syst Engn, Yongin 17035, South Korea
[2] Hankuk Univ Foreign Studies, Artificial Intelligence Convergence, Yongin 17035, South Korea
[3] Hankuk Univ Foreign Studies, Stat, Yongin 17035, South Korea
[4] Hankuk Univ Foreign Studies, Artificial Intelligence Educ, Yongin 17035, South Korea
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 05期
基金
新加坡国家研究基金会;
关键词
crop diseases diagnosis; farm-tech; deep learning; Inceptionv3; transformer; image captioning; YOLOv5; object detection;
D O I
10.3390/app13053148
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The number of people participating in urban farming and its market size have been increasing recently. However, the technologies that assist the novice farmers are still limited. There are several previously researched deep learning-based crop disease diagnosis solutions. However, these techniques only focus on CNN-based disease detection and do not explain the characteristics of disease symptoms based on severity. In order to prevent the spread of diseases in crops, it is important to identify the characteristics of these disease symptoms in advance and cope with them as soon as possible. Therefore, we propose an improved crop disease diagnosis solution which can give practical help to novice farmers. The proposed solution consists of two representative deep learning-based methods: Image Captioning and Object Detection. The Image Captioning model describes prominent symptoms of the disease, according to severity in detail, by generating diagnostic sentences which are grammatically correct and semantically comprehensible, along with presenting the accurate name of it. Meanwhile, the Object Detection model detects the infected area to help farmers recognize which part is damaged and assure them of the accuracy of the diagnosis sentence generated by the Image Captioning model. The Image Captioning model in the proposed solution employs the InceptionV3 model as an encoder and the Transformer model as a decoder, while the Object Detection model of the proposed solution employs the YOLOv5 model. The average BLEU score of the Image Captioning model is 64.96%, which can be considered to have high performance of sentence generation and, meanwhile, the mAP50 for the Object Detection model is 0.382, which requires further improvement. Those results indicate that the proposed solution allows the precise and elaborate information of the crop diseases, thereby increasing the overall reliability of the diagnosis.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] A Proposal to Ensure Social Distancing with Deep Learning-based Object Detection
    Mercaldo, Francesco
    Martinelli, Fabio
    Santone, Antonella
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [22] A survey of deep learning-based object detection: Application and open issues
    Abdullah, Shaymaa Tarkan
    AL-Nuaimi, Bashar Talib
    Abed, Hazim Noman
    INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2022, 13 (02): : 1495 - 1504
  • [23] Road object detection: a comparative study of deep learning-based algorithms
    Mahaur, Bharat
    Singh, Navjot
    Mishra, K. K.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (10) : 14247 - 14282
  • [24] Real-Time Deep Learning-Based Object Detection Framework
    Tarimo, William
    Sabra, Moustafa M.
    Hendre, Shonan
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 1829 - 1836
  • [25] Deep learning-based object detection for dynamic construction site management
    Xu, Jiayi
    Pan, Wei
    AUTOMATION IN CONSTRUCTION, 2024, 165
  • [26] Prevention of smombie accidents using deep learning-based object detection
    Kim, Hyun-Seok
    Kim, Geon-Hwan
    Cho, You-Ze
    ICT EXPRESS, 2022, 8 (04): : 618 - 625
  • [27] Deep learning-based object detection in augmented reality: A systematic review
    Ghasemi, Yalda
    Jeong, Heejin
    Choi, Sung Ho
    Park, Kyeong-Beom
    Lee, Jae Yeol
    COMPUTERS IN INDUSTRY, 2022, 139
  • [28] Road object detection: a comparative study of deep learning-based algorithms
    Bharat Mahaur
    Navjot Singh
    K. K. Mishra
    Multimedia Tools and Applications, 2022, 81 : 14247 - 14282
  • [29] A Deep Learning-Based Crop Disease Diagnosis Method Using Multimodal Mixup Augmentation
    Lee, Hyunseok
    Park, Young-Sang
    Yang, Songho
    Lee, Hoyul
    Park, Tae-Jin
    Yeo, Doyeob
    APPLIED SCIENCES-BASEL, 2024, 14 (10):
  • [30] Deep Learning-Based Object Detection Strategies for Disease Detection and Localization in Chest X-Ray Images
    Cheng, Yi-Ching
    Hung, Yi-Chieh
    Huang, Guan-Hua
    Chen, Tai-Been
    Lu, Nan-Han
    Liu, Kuo-Ying
    Lin, Kuo-Hsuan
    DIAGNOSTICS, 2024, 14 (23)