End-to-end deep learning for directly estimating grape yield from ground-based imagery

被引：21

作者：

Olenskyj, Alexander G. ^{[1
]}

Sams, Brent S. ^{[2
]}

Fei, Zhenghao ^{[1
]}

Singh, Vishal ^{[3
]}

Raja, Pranav, V ^{[1
]}

Bornhorst, Gail M. ^{[1
,4
]}

Earles, J. Mason ^{[1
,5
]}

机构：

[1] Univ Calif Davis, Dept Biol & Agr Engn, Davis, CA 95616 USA

[2] E&J Gallo Winery, Dept Winegrowing Res, Modesto, CA 95354 USA

[3] Univ Calif Davis, Dept Mech & Aerosp Engn, Davis, CA 95616 USA

[4] Riddet Inst, Palmerston North, New Zealand

[5] Univ Calif Davis, Dept Viticulture & Enol, Davis, CA 95616 USA

来源：

COMPUTERS AND ELECTRONICS IN AGRICULTURE | 2022年 / 198卷

基金：

美国国家科学基金会;

关键词：

Deep learning; Deep regression; Yield estimation; Proximal sensing; Vineyard variability; NEURAL-NETWORKS; FRUIT; STAGE; PREDICTION; SYSTEMS;

D O I：

10.1016/j.compag.2022.107081

中图分类号：

S [农业科学];

学科分类号：

09 ;

摘要：

Yield estimation prior to harvest is a powerful tool in vineyard management, as it allows growers to fine-tune management practices to optimize yield and quality. However, yield estimation is currently performed using manual sampling, which is time-consuming and imprecise. This study demonstrates the applicability of nondestructive proximal imaging combined with deep learning for yield estimation in vineyards. Continuous image data collection using a vehicle-mounted sensing kit combined with collection of ground truth yield data at harvest using a commercial yield monitor allowed for the generation of a large dataset of 23,581 yield points and 107,933 images. Moreover, this study was conducted in a commercial vineyard which was mechanically managed, representing a challenging environment for image analysis but a common set of conditions in the California Central Valley. Three model architectures were tested: object detection, CNN regression, and transformer models. The object detection model was trained on hand-labeled images to localize grape bunches, and detections were either counted or their pixel count was summed to obtain a metric which was correlated to grape yield. Conversely, regression models were trained end-to-end to directly predict grape yield from image data without the need for hand labeling. Results demonstrated that both a transformer model as well as the object detection model with pixel area processing performed comparably, with a mean absolute percent error of 18% and 18.5%, respectively on a representative holdout dataset. Saliency mapping was used to demonstrate the attention of the CNN regression model was localized near the predicted location of grape bunches, as well as on the top of the grapevine canopy. Overall, the study demonstrated the applicability of proximal imaging and deep learning for prediction of grapevine yield on a large scale. Additionally, the end-to-end modeling approach was able to perform comparably to the object detection approach while eliminating the need for hand-labeling.

引用

页数：14

共 51 条

[1] Bargoti S., 2015, IEEE INT C INTELL RO, P1
[2] Image Segmentation for Fruit Detection and Yield Estimation in Apple Orchards
Bargoti, Suchet
Underwood, James P.
[J]. JOURNAL OF FIELD ROBOTICS, 2017, 34 (06) : 1039 - 1060
[3] A General and Adaptive Robust Loss Function
Barron, Jonathan T.
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4326 - 4334
[4] Bramley RGV, 2004, AUST J GRAPE WINE R, V10, P32, DOI 10.1111/j.1755-0238.2004.tb00006.x
[5] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
[6] de la Fuente M, 2015, J INT SCI VIGNE VIN, V49, P27
[7] Convolutional neural network: a review of models, methodologies and applications to object detection
Dhillon, Anamika
Verma, Gyanendra K.
[J]. PROGRESS IN ARTIFICIAL INTELLIGENCE, 2020, 9 (02) : 85 - 112
[8] A Low-Cost and Unsupervised Image Recognition Methodology for Yield Estimation in a Vineyard
Di Gennaro, Salvatore Filippo
Toscano, Piero
Cinat, Paolo
Berton, Andrea
Matese, Alessandro
[J]. FRONTIERS IN PLANT SCIENCE, 2019, 10
[9] Grapevine Yield and Leaf Area Estimation Using Supervised Classification Methodology on RGB Images Taken under Field Conditions
Diago, Maria-Paz
Correa, Christian
Millan, Borja
Barreiro, Pilar
Valero, Constantino
Tardaguila, Javier
[J]. SENSORS, 2012, 12 (12): : 16988 - 17006
[10] Dosovitskiy A., 2021, IMAGE IS WORTH 1616

← 1 2 3 4 5 6 →