Crop Disease Diagnosis with Deep Learning-Based Image Captioning and Object Detection

被引:11
作者
Lee, Dong In [1 ]
Lee, Ji Hwan [2 ]
Jang, Seung Ho [3 ]
Oh, Se Jong [4 ]
Doo, Ill Chul [4 ]
机构
[1] Hankuk Univ Foreign Studies, Comp & Elect Syst Engn, Yongin 17035, South Korea
[2] Hankuk Univ Foreign Studies, Artificial Intelligence Convergence, Yongin 17035, South Korea
[3] Hankuk Univ Foreign Studies, Stat, Yongin 17035, South Korea
[4] Hankuk Univ Foreign Studies, Artificial Intelligence Educ, Yongin 17035, South Korea
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 05期
基金
新加坡国家研究基金会;
关键词
crop diseases diagnosis; farm-tech; deep learning; Inceptionv3; transformer; image captioning; YOLOv5; object detection;
D O I
10.3390/app13053148
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The number of people participating in urban farming and its market size have been increasing recently. However, the technologies that assist the novice farmers are still limited. There are several previously researched deep learning-based crop disease diagnosis solutions. However, these techniques only focus on CNN-based disease detection and do not explain the characteristics of disease symptoms based on severity. In order to prevent the spread of diseases in crops, it is important to identify the characteristics of these disease symptoms in advance and cope with them as soon as possible. Therefore, we propose an improved crop disease diagnosis solution which can give practical help to novice farmers. The proposed solution consists of two representative deep learning-based methods: Image Captioning and Object Detection. The Image Captioning model describes prominent symptoms of the disease, according to severity in detail, by generating diagnostic sentences which are grammatically correct and semantically comprehensible, along with presenting the accurate name of it. Meanwhile, the Object Detection model detects the infected area to help farmers recognize which part is damaged and assure them of the accuracy of the diagnosis sentence generated by the Image Captioning model. The Image Captioning model in the proposed solution employs the InceptionV3 model as an encoder and the Transformer model as a decoder, while the Object Detection model of the proposed solution employs the YOLOv5 model. The average BLEU score of the Image Captioning model is 64.96%, which can be considered to have high performance of sentence generation and, meanwhile, the mAP50 for the Object Detection model is 0.382, which requires further improvement. Those results indicate that the proposed solution allows the precise and elaborate information of the crop diseases, thereby increasing the overall reliability of the diagnosis.
引用
收藏
页数:19
相关论文
共 50 条
[21]   Deep Learning-Based Medical Object Detection: A Survey [J].
Saraei, Mohammadreza ;
Lalinia, Mehrshad ;
Lee, Eung-Joo .
IEEE ACCESS, 2025, 13 :53019-53038
[22]   Precision Agriculture and Object Detection: Deep Learning Models for Crop Disease Management [J].
Huerta-Mora, Eduardo A. ;
Rodriguez-Mata, Abraham E. ;
Medrano-Hermosillo, Jesus A. ;
Amabilis-Sosa, Leonel E. ;
Gonzalez-Huitron, Victor A. ;
Rangel, Hector Rodriguez .
INTERNATIONAL JOURNAL OF COMBINATORIAL OPTIMIZATION PROBLEMS AND INFORMATICS, 2025, 16 (02) :84-97
[23]   A survey of deep learning-based object detection: Application and open issues [J].
Abdullah, Shaymaa Tarkan ;
AL-Nuaimi, Bashar Talib ;
Abed, Hazim Noman .
INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2022, 13 (02) :1495-1504
[24]   A Proposal to Ensure Social Distancing with Deep Learning-based Object Detection [J].
Mercaldo, Francesco ;
Martinelli, Fabio ;
Santone, Antonella .
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[25]   Deep Learning-Based Approach for Object Detection in Robot Football Competition [J].
Wang, Zhaoyan .
2022 INTERNATIONAL CONFERENCE ON FRONTIERS OF ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING, FAIML, 2022, :179-185
[26]   Real-Time Deep Learning-Based Object Detection Framework [J].
Tarimo, William ;
Sabra, Moustafa M. ;
Hendre, Shonan .
2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, :1829-1836
[27]   Prevention of smombie accidents using deep learning-based object detection [J].
Kim, Hyun-Seok ;
Kim, Geon-Hwan ;
Cho, You-Ze .
ICT EXPRESS, 2022, 8 (04) :618-625
[28]   Deep learning-based object detection for dynamic construction site management [J].
Xu, Jiayi ;
Pan, Wei .
AUTOMATION IN CONSTRUCTION, 2024, 165
[29]   Deep learning-based object detection in augmented reality: A systematic review [J].
Ghasemi, Yalda ;
Jeong, Heejin ;
Choi, Sung Ho ;
Park, Kyeong-Beom ;
Lee, Jae Yeol .
COMPUTERS IN INDUSTRY, 2022, 139
[30]   Road object detection: a comparative study of deep learning-based algorithms [J].
Bharat Mahaur ;
Navjot Singh ;
K. K. Mishra .
Multimedia Tools and Applications, 2022, 81 :14247-14282