Crop Disease Diagnosis with Deep Learning-Based Image Captioning and Object Detection

被引:11
作者
Lee, Dong In [1 ]
Lee, Ji Hwan [2 ]
Jang, Seung Ho [3 ]
Oh, Se Jong [4 ]
Doo, Ill Chul [4 ]
机构
[1] Hankuk Univ Foreign Studies, Comp & Elect Syst Engn, Yongin 17035, South Korea
[2] Hankuk Univ Foreign Studies, Artificial Intelligence Convergence, Yongin 17035, South Korea
[3] Hankuk Univ Foreign Studies, Stat, Yongin 17035, South Korea
[4] Hankuk Univ Foreign Studies, Artificial Intelligence Educ, Yongin 17035, South Korea
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 05期
基金
新加坡国家研究基金会;
关键词
crop diseases diagnosis; farm-tech; deep learning; Inceptionv3; transformer; image captioning; YOLOv5; object detection;
D O I
10.3390/app13053148
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The number of people participating in urban farming and its market size have been increasing recently. However, the technologies that assist the novice farmers are still limited. There are several previously researched deep learning-based crop disease diagnosis solutions. However, these techniques only focus on CNN-based disease detection and do not explain the characteristics of disease symptoms based on severity. In order to prevent the spread of diseases in crops, it is important to identify the characteristics of these disease symptoms in advance and cope with them as soon as possible. Therefore, we propose an improved crop disease diagnosis solution which can give practical help to novice farmers. The proposed solution consists of two representative deep learning-based methods: Image Captioning and Object Detection. The Image Captioning model describes prominent symptoms of the disease, according to severity in detail, by generating diagnostic sentences which are grammatically correct and semantically comprehensible, along with presenting the accurate name of it. Meanwhile, the Object Detection model detects the infected area to help farmers recognize which part is damaged and assure them of the accuracy of the diagnosis sentence generated by the Image Captioning model. The Image Captioning model in the proposed solution employs the InceptionV3 model as an encoder and the Transformer model as a decoder, while the Object Detection model of the proposed solution employs the YOLOv5 model. The average BLEU score of the Image Captioning model is 64.96%, which can be considered to have high performance of sentence generation and, meanwhile, the mAP50 for the Object Detection model is 0.382, which requires further improvement. Those results indicate that the proposed solution allows the precise and elaborate information of the crop diseases, thereby increasing the overall reliability of the diagnosis.
引用
收藏
页数:19
相关论文
共 50 条
[41]   Application of Deep Learning-Based Object Detection Techniques in Fish Aquaculture: A Review [J].
Liu, Hanchi ;
Ma, Xin ;
Yu, Yining ;
Wang, Liang ;
Hao, Lin .
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (04)
[42]   TOSS: Deep Learning-Based Track Object Detection Using Smart Sensor [J].
Rajeswari, D. ;
Rajendran, Srinivasan ;
Arivarasi, A. ;
Govindasamy, Alagiri ;
Ahilan, A. .
IEEE SENSORS JOURNAL, 2024, 24 (22) :37678-37686
[43]   Introduction to Computer Vision and Real Time Deep Learning-based Object Detection [J].
Shanahan, James G. ;
Dai, Liang .
KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, :3523-3524
[44]   A Deep Learning-based Radar and Camera Sensor Fusion Architecture for Object Detection [J].
Nobis, Felix ;
Geisslinger, Maximilian ;
Weber, Markus ;
Betz, Johannes ;
Lienkamp, Markus .
2019 SYMPOSIUM ON SENSOR DATA FUSION: TRENDS, SOLUTIONS, APPLICATIONS (SDF 2019), 2019,
[45]   A Deep Learning-Based Hybrid Framework for Object Detection and Recognition in Autonomous Driving [J].
Li, Yanfen ;
Wang, Hanxiang ;
Dang, L. Minh ;
Nguyen, Tan N. ;
Han, Dongil ;
Lee, Ahyun ;
Jang, Insung ;
Moon, Hyeonjoon .
IEEE ACCESS, 2020, 8 :194228-194239
[46]   A deep learning-based attack on text CAPTCHAs by using object detection techniques [J].
Nian, Jiawei ;
Wang, Ping ;
Gao, Haichang ;
Guo, Xiaoyan .
IET INFORMATION SECURITY, 2022, 16 (02) :97-110
[47]   Deep Learning-Based Object Detection Techniques for Remote Sensing Images: A Survey [J].
Li, Zheng ;
Wang, Yongcheng ;
Zhang, Ning ;
Zhang, Yuxi ;
Zhao, Zhikang ;
Xu, Dongdong ;
Ben, Guangli ;
Gao, Yunxiao .
REMOTE SENSING, 2022, 14 (10)
[48]   A review on anchor assignment and sampling heuristics in deep learning-based object detection [J].
Vo, Xuan-Thuy ;
Jo, Kang-Hyun .
NEUROCOMPUTING, 2022, 506 :96-116
[49]   Introduction to Computer Vision and Real Time Deep Learning-based Object Detection [J].
Shanahan, James G. .
CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, :3515-3516
[50]   Deep Learning-Based Real-time Object Detection in Inland Navigation [J].
Hammedi, Wided ;
Ramirez-Martinez, Metzli ;
Brunet, Philippe ;
Senouci, Sidi Mohammed ;
Messous, Mohamed Ayoub .
2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,