An In-Field Dynamic Vision-Based Analysis for Vineyard Yield Estimation

被引:1
作者
Ahmedt-Aristizabal, David [1 ]
Smith, Daniel [2 ]
Khokher, Muhammad Rizwan [1 ]
Li, Xun [1 ]
Smith, Adam L. [3 ]
Petersson, Lars [1 ]
Rolland, Vivien [3 ]
Edwards, Everard J. [3 ]
机构
[1] Commonwealth Sci & Ind Res Org CSIRO, Imaging & Comp Vis Grp, Data61, Canberra, ACT 2601, Australia
[2] Commonwealth Sci & Ind Res Org CSIRO, Spatio Temporal Analyt Team, Data61, Canberra, ACT 2601, Australia
[3] Commonwealth Sci & Ind Res Org CSIRO, Dept Agr & Food, Canberra, ACT 2601, Australia
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Yield estimation; Feature extraction; Transformers; Deep learning; Semantic segmentation; Object recognition; Precision agriculture; Crop yield; Weight measurement; Precision viticulture; bunch detection and segmentation with transformers; multi-bunch tracking and counting; density-based berry counting; weight regression; grapevine yield estimation; GRAPE BUNCH DETECTION; BERRIES; IMAGES;
D O I
10.1109/ACCESS.2024.3431244
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurately predicting grape yield in vineyards is essential for strategic decision-making in the wine industry. Current methods are labour-intensive, costly, and lack spatial coverage, reducing accuracy and cost-efficiency. Efforts to automate and enhance yield estimation focus on scaling fruit weight assessments. Machine learning, particularly deep learning, shows promise in improving accuracy through automatic feature extraction and hierarchical representation. However, most of these methods have been developed for analyses at a particular time point and solutions able to consider temporal information captured across sequential frames are currently poorly developed. This paper addresses this gap by introducing a system for yield estimation, utilising publicly available data repositories, such as Embrapa WGISD, alongside an in-house dataset collected by a Blackmagic camera at the pre-harvest stage. We introduce a system that utilises bunch weight regression to estimate grape yield. Bunch weight estimates are obtained by summing samples randomly drawn from the grape bunch weight distribution through empirical calibration. Grapevine bunches are identified and segmented using Mask R-CNN with Swin Transformer, and a SiamFC-based tracking mechanism is employed to estimate the number of unique bunches per panel or row. The number of berries for each tracked bunch is determined using a density approach known as multitask point supervision. In our experiments, we demonstrate the effectiveness of the proposed system for yield estimation, achieving harvested weight errors of less than 5% in two of the three vineyard panels. Larger harvest weight errors (around 15%) were observed due to inaccuracies in tracking certain bunches caused by dense concentration of bunches in one panel. However, these errors should be contrasted with the current practice error of up to 30%, highlighting the potential of machine vision for hands-off yield estimation at scale.
引用
收藏
页码:102146 / 102166
页数:21
相关论文
共 115 条
  • [31] Assessment of cluster yield components by image analysis
    Diago, Maria P.
    Tardaguila, Javier
    Aleixos, Nuria
    Millan, Borja
    Prats-Montalban, Jose M.
    Cubero, Sergio
    Blasco, Jose
    [J]. JOURNAL OF THE SCIENCE OF FOOD AND AGRICULTURE, 2015, 95 (06) : 1274 - 1282
  • [32] Dosovitskiy A., 2021, P INT C REPR LEARN, P213
  • [33] Instance Segmentation and Berry Counting of Table Grape before Thinning Based on AS-SwinT
    Du, Wensheng
    Liu, Ping
    [J]. PLANT PHENOMICS, 2023, 5
  • [34] Technologies and Innovative Methods for Precision Viticulture: A Comprehensive Review
    Ferro, Massimo Vincenzo
    Catania, Pietro
    [J]. HORTICULTURAE, 2023, 9 (03)
  • [35] Vineyard Yield Estimation Based on the Analysis of High Resolution Images Obtained with Artificial Illumination at Night
    Font, Davinia
    Tresanchez, Marcel
    Martinez, Dani
    Moreno, Javier
    Clotet, Eduard
    Palacin, Jordi
    [J]. SENSORS, 2015, 15 (04) : 8284 - 8301
  • [36] Gao GS, 2020, Arxiv, DOI arXiv:2003.12783
  • [37] In-Field Automatic Detection of Grape Bunches under a Totally Uncontrolled Environment
    Ghiani, Luca
    Sassu, Alberto
    Palumbo, Francesca
    Mercenaro, Luca
    Gambella, Filippo
    [J]. SENSORS, 2021, 21 (11)
  • [38] Rich feature hierarchies for accurate object detection and semantic segmentation
    Girshick, Ross
    Donahue, Jeff
    Darrell, Trevor
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
  • [39] Grimm J., 2018, arXiv
  • [40] An adaptable approach to automated visual detection of plant organs with applications in grapevine breeding
    Grimm, Jonatan
    Herzog, Katja
    Rist, Florian
    Kicherer, Anna
    Toepfer, Reinhard
    Steinhage, Volker
    [J]. BIOSYSTEMS ENGINEERING, 2019, 183 : 170 - 183