A Mobile-Based Deep Learning Model for Cassava Disease Diagnosis

被引:130
作者
Ramcharan, Amanda [1 ]
McCloskey, Peter [1 ]
Baranowski, Kelsee [1 ]
Mbilinyi, Neema [2 ]
Mrisho, Latifa [2 ]
Ndalahwa, Mathias [2 ]
Legg, James [2 ]
Hughes, David P. [1 ,3 ,4 ]
机构
[1] Penn State Univ, Coll Agr Sci, Dept Entomol, State Coll, PA 16801 USA
[2] Int Inst Trop Agr, Dar El Salaam, Tanzania
[3] Penn State Univ, Eberly Coll Sci, Dept Biol, State Coll, PA 16801 USA
[4] Penn State Univ, Huck Inst Life Sci, Ctr Infect Dis Dynam, State Coll, PA 16801 USA
关键词
cassava disease detection; deep learning; convolutional neural networks; mobile plant disease diagnostics; object detection;
D O I
10.3389/fpls.2019.00272
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Convolutional neural network (CNN) models have the potential to improve plant disease phenotyping where the standard approach is visual diagnostics requiring specialized training. In scenarios where a CNN is deployed on mobile devices, models are presented with new challenges due to lighting and orientation. It is essential for model assessment to be conducted in real world conditions if such models are to be reliably integrated with computer vision products for plant disease phenotyping. We train a CNN object detection model to identify foliar symptoms of diseases in cassava (Manihot esculenta Crantz). We then deploy the model in a mobile app and test its performance on mobile images and video of 720 diseased leaflets in an agricultural field in Tanzania. Within each disease category we test two levels of severity of symptoms-mild and pronounced, to assess the model performance for early detection of symptoms. In both severities we see a decrease in performance for real world images and video as measured with the F-1 score. The F-1 score dropped by 32% for pronounced symptoms in real world images (the closest data to the training data) due to a decrease in model recall. If the potential of mobile CNN models are to be realized our data suggest it is crucial to consider tuning recall in order to achieve the desired performance in real world settings. In addition, the varied performance related to different input data (image or video) is an important consideration for design in real world applications.
引用
收藏
页数:8
相关论文
共 22 条
[1]  
[Anonymous], 2016, CoRR abs/1512.00567, DOI DOI 10.1109/CVPR.2016.308
[2]  
[Anonymous], 2017, TENSORFLOW DETECTION
[3]  
[Anonymous], 2016, P BRIT MACH VIS C YO
[4]   A theory of learning from different domains [J].
Ben-David, Shai ;
Blitzer, John ;
Crammer, Koby ;
Kulesza, Alex ;
Pereira, Fernando ;
Vaughan, Jennifer Wortman .
MACHINE LEARNING, 2010, 79 (1-2) :151-175
[5]   Plant Disease Severity Estimated Visually, by Digital Photography and Image Analysis, and by Hyperspectral Imaging [J].
Bock, C. H. ;
Poole, G. H. ;
Parker, P. E. ;
Gottwald, T. R. .
CRITICAL REVIEWS IN PLANT SCIENCES, 2010, 29 (02) :59-107
[6]   Dermatologist-level classification of skin cancer with deep neural networks [J].
Esteva, Andre ;
Kuprel, Brett ;
Novoa, Roberto A. ;
Ko, Justin ;
Swetter, Susan M. ;
Blau, Helen M. ;
Thrun, Sebastian .
NATURE, 2017, 542 (7639) :115-+
[7]   Science, technology and the future of small autonomous drones [J].
Floreano, Dario ;
Wood, Robert J. .
NATURE, 2015, 521 (7553) :460-466
[8]  
He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]  
Howard AG, 2017, ARXIV