Detection and analysis of wheat spikes using Convolutional Neural Networks

被引:195
作者
Hasan, Md Mehedi [1 ]
Chopin, Joshua P. [1 ]
Laga, Hamid [2 ]
Miklavcic, Stanley J. [1 ]
机构
[1] Univ South Australia, Phen & Bioinformat Res Ctr, Adelaide, SA 5095, Australia
[2] Murdoch Univ, Sch Engn & Informat Technol, Perth, WA 6150, Australia
基金
澳大利亚研究理事会;
关键词
Plant phenotyping; Spike detection; Deep learning; Field imaging; Statistical analysis; HIGH-THROUGHPUT; SYSTEM;
D O I
10.1186/s13007-018-0366-8
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundField phenotyping by remote sensing has received increased interest in recent years with the possibility of achieving high-throughput analysis of crop fields. Along with the various technological developments, the application of machine learning methods for image analysis has enhanced the potential for quantitative assessment of a multitude of crop traits. For wheat breeding purposes, assessing the production of wheat spikes, as the grain-bearing organ, is a useful proxy measure of grain production. Thus, being able to detect and characterize spikes from images of wheat fields is an essential component in a wheat breeding pipeline for the selection of high yielding varieties. ResultsWe have applied a deep learning approach to accurately detect, count and analyze wheat spikes for yield estimation. We have tested the approach on a set of images of wheat field trial comprising 10 varieties subjected to three fertilizer treatments. The images have been captured over one season, using high definition RGB cameras mounted on a land-based imaging platform, and viewing the wheat plots from an oblique angle. A subset of in-field images has been accurately labeled by manually annotating all the spike regions. This annotated dataset, called SPIKE, is then used to train four region-based Convolutional Neural Networks (R-CNN) which take, as input, images of wheat plots, and accurately detect and count spike regions in each plot. The CNNs also output the spike density and a classification probability for each plot. Using the same R-CNN architecture, four different models were generated based on four different datasets of training and testing images captured at various growth stages. Despite the challenging field imaging conditions, e.g., variable illumination conditions, high spike occlusion, and complex background, the four R-CNN models achieve an average detection accuracy ranging from 88 to 94% across different sets of test images. The most robust R-CNN model, which achieved the highest accuracy, is then selected to study the variation in spike production over 10 wheat varieties and three treatments. The SPIKE dataset and the trained CNN are the main contributions of this paper. ConclusionWith the availability of good training datasets such us the SPIKE dataset proposed in this article, deep learning techniques can achieve high accuracy in detecting and counting spikes from complex wheat field images. The proposed robust R-CNN model, which has been trained on spike images captured during different growth stages, is optimized for application to a wider variety of field scenarios. It accurately quantifies the differences in yield produced by the 10 varieties we have studied, and their respective responses to fertilizer treatment. We have also observed that the other R-CNN models exhibit more specialized performances. The data set and the R-CNN model, which we make publicly available, have the potential to greatly benefit plant breeders by facilitating the high throughput selection of high yielding varieties.
引用
收藏
页数:13
相关论文
共 34 条
[1]   Automatic Counting of Wheat Spikes from Wheat Growth Images [J].
Alharbi, Najmah ;
Zhou, Ji ;
Wang, Wenjia .
PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM 2018), 2018, :346-355
[2]  
Azzari G, 2015, P AGU FALL M
[3]   LeafNet: A computer vision system for automatic plant species identification [J].
Barre, Pierre ;
Stoever, Ben C. ;
Mueller, Kai F. ;
Steinhage, Volker .
ECOLOGICAL INFORMATICS, 2017, 40 :50-56
[4]  
[毕昆 Bi Kun], 2011, [中国农学通报, Chinese Agricultural Science Bulletin], V27, P464
[5]  
Bi Kun Bi Kun, 2010, Transactions of the Chinese Society of Agricultural Engineering, V26, P212
[6]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[7]   Wheat ear counting in-field conditions: high throughput and low-cost approach using RGB images [J].
Fernandez-Gallego, Jose A. ;
Kefauver, Shawn C. ;
Aparicio Gutierrez, Nieves ;
Teresa Nieto-Taladriz, Maria ;
Luis Araus, Jose .
PLANT METHODS, 2018, 14
[8]  
Girshick R., 2014, P IEEE C COMP VIS PA, DOI [10.1109/CVPR.2014.81, DOI 10.1109/CVPR.2014.81, 10.1109/cvpr.2014.81]
[9]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[10]   High Throughput Field Phenotyping of Wheat Plant Height and Growth Rate in Field Plot Trials Using UAV Based Remote Sensing [J].
Holman, Fenner H. ;
Riche, Andrew B. ;
Michalski, Adam ;
Castle, March ;
Wooster, Martin J. ;
Hawkesford, Malcolm J. .
REMOTE SENSING, 2016, 8 (12)