End-to-end deep learning for directly estimating grape yield from ground-based imagery

被引:21
作者
Olenskyj, Alexander G. [1 ]
Sams, Brent S. [2 ]
Fei, Zhenghao [1 ]
Singh, Vishal [3 ]
Raja, Pranav, V [1 ]
Bornhorst, Gail M. [1 ,4 ]
Earles, J. Mason [1 ,5 ]
机构
[1] Univ Calif Davis, Dept Biol & Agr Engn, Davis, CA 95616 USA
[2] E&J Gallo Winery, Dept Winegrowing Res, Modesto, CA 95354 USA
[3] Univ Calif Davis, Dept Mech & Aerosp Engn, Davis, CA 95616 USA
[4] Riddet Inst, Palmerston North, New Zealand
[5] Univ Calif Davis, Dept Viticulture & Enol, Davis, CA 95616 USA
基金
美国国家科学基金会;
关键词
Deep learning; Deep regression; Yield estimation; Proximal sensing; Vineyard variability; NEURAL-NETWORKS; FRUIT; STAGE; PREDICTION; SYSTEMS;
D O I
10.1016/j.compag.2022.107081
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Yield estimation prior to harvest is a powerful tool in vineyard management, as it allows growers to fine-tune management practices to optimize yield and quality. However, yield estimation is currently performed using manual sampling, which is time-consuming and imprecise. This study demonstrates the applicability of nondestructive proximal imaging combined with deep learning for yield estimation in vineyards. Continuous image data collection using a vehicle-mounted sensing kit combined with collection of ground truth yield data at harvest using a commercial yield monitor allowed for the generation of a large dataset of 23,581 yield points and 107,933 images. Moreover, this study was conducted in a commercial vineyard which was mechanically managed, representing a challenging environment for image analysis but a common set of conditions in the California Central Valley. Three model architectures were tested: object detection, CNN regression, and transformer models. The object detection model was trained on hand-labeled images to localize grape bunches, and detections were either counted or their pixel count was summed to obtain a metric which was correlated to grape yield. Conversely, regression models were trained end-to-end to directly predict grape yield from image data without the need for hand labeling. Results demonstrated that both a transformer model as well as the object detection model with pixel area processing performed comparably, with a mean absolute percent error of 18% and 18.5%, respectively on a representative holdout dataset. Saliency mapping was used to demonstrate the attention of the CNN regression model was localized near the predicted location of grape bunches, as well as on the top of the grapevine canopy. Overall, the study demonstrated the applicability of proximal imaging and deep learning for prediction of grapevine yield on a large scale. Additionally, the end-to-end modeling approach was able to perform comparably to the object detection approach while eliminating the need for hand-labeling.
引用
收藏
页数:14
相关论文
共 51 条
  • [1] Bargoti S., 2015, IEEE INT C INTELL RO, P1
  • [2] Image Segmentation for Fruit Detection and Yield Estimation in Apple Orchards
    Bargoti, Suchet
    Underwood, James P.
    [J]. JOURNAL OF FIELD ROBOTICS, 2017, 34 (06) : 1039 - 1060
  • [3] A General and Adaptive Robust Loss Function
    Barron, Jonathan T.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4326 - 4334
  • [4] Bramley RGV, 2004, AUST J GRAPE WINE R, V10, P32, DOI 10.1111/j.1755-0238.2004.tb00006.x
  • [5] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
  • [6] de la Fuente M, 2015, J INT SCI VIGNE VIN, V49, P27
  • [7] Convolutional neural network: a review of models, methodologies and applications to object detection
    Dhillon, Anamika
    Verma, Gyanendra K.
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE, 2020, 9 (02) : 85 - 112
  • [8] A Low-Cost and Unsupervised Image Recognition Methodology for Yield Estimation in a Vineyard
    Di Gennaro, Salvatore Filippo
    Toscano, Piero
    Cinat, Paolo
    Berton, Andrea
    Matese, Alessandro
    [J]. FRONTIERS IN PLANT SCIENCE, 2019, 10
  • [9] Grapevine Yield and Leaf Area Estimation Using Supervised Classification Methodology on RGB Images Taken under Field Conditions
    Diago, Maria-Paz
    Correa, Christian
    Millan, Borja
    Barreiro, Pilar
    Valero, Constantino
    Tardaguila, Javier
    [J]. SENSORS, 2012, 12 (12): : 16988 - 17006
  • [10] Dosovitskiy A., 2021, IMAGE IS WORTH 1616