Automatic Bunch Detection in White Grape Varieties Using YOLOv3, YOLOv4, and YOLOv5 Deep Learning Algorithms

被引:160
作者
Sozzi, Marco [1 ]
Cantalamessa, Silvia [2 ]
Cogato, Alessia [3 ]
Kayad, Ahmed [1 ]
Marinello, Francesco [1 ]
机构
[1] Univ Padua, Dept Land Environm Agr & Forestry, I-35020 Legnaro, Italy
[2] Univ Padua, Dept Agron Food Nat Resources Anim & Environm, I-35020 Legnaro, Italy
[3] Univ Udine, Dept Agr Food Environm & Anim Sci, I-33100 Udine, Italy
来源
AGRONOMY-BASEL | 2022年 / 12卷 / 02期
关键词
viticulture; early yield estimation; real-time detection; cluster detection; smart farming; crop load mapping; yield estimation; precision viticulture; agricultural robot; YIELD PREDICTION; FRUIT DETECTION; VINEYARDS; SENSORS;
D O I
10.3390/agronomy12020319
中图分类号
S3 [农学(农艺学)];
学科分类号
0901 ;
摘要
Over the last few years, several Convolutional Neural Networks for object detection have been proposed, characterised by different accuracy and speed. In viticulture, yield estimation and prediction is used for efficient crop management, taking advantage of precision viticulture techniques. Convolutional Neural Networks for object detection represent an alternative methodology for grape yield estimation, which usually relies on manual harvesting of sample plants. In this paper, six versions of the You Only Look Once (YOLO) object detection algorithm (YOLOv3, YOLOv3-tiny, YOLOv4, YOLOv4-tiny, YOLOv5x, and YOLOv5s) were evaluated for real-time bunch detection and counting in grapes. White grape varieties were chosen for this study, as the identification of white berries on a leaf background is trickier than red berries. YOLO models were trained using a heterogeneous dataset populated by images retrieved from open datasets and acquired on the field in several illumination conditions, background, and growth stages. Results have shown that YOLOv5x and YOLOv4 achieved an F1-score of 0.76 and 0.77, respectively, with a detection speed of 31 and 32 FPS. Differently, YOLO5s and YOLOv4-tiny achieved an F1-score of 0.76 and 0.69, respectively, with a detection speed of 61 and 196 FPS. The final YOLOv5x model for bunch number, obtained considering bunch occlusion, was able to estimate the number of bunches per plant with an average error of 13.3% per vine. The best combination of accuracy and speed was achieved by YOLOv4-tiny, which should be considered for real-time grape yield estimation, while YOLOv3 was affected by a False Positive-False Negative compensation, which decreased the RMSE.
引用
收藏
页数:17
相关论文
共 63 条
[1]  
Abdulsalam M, 2020, MED C CONTR AUTOMAT, P1087, DOI [10.1109/MED48518.2020.9183325, 10.1109/med48518.2020.9183325]
[2]   Grape Bunch Detection at Different Growth Stages Using Deep Learning Quantized Models [J].
Aguiar, Andre Silva ;
Magalhaes, Sandro Augusto ;
dos Santos, Filipe Neves ;
Castro, Luis ;
Pinho, Tatiana ;
Valente, Joao ;
Martins, Rui ;
Boaventura-Cunha, Jose .
AGRONOMY-BASEL, 2021, 11 (09)
[3]  
[Anonymous], 2020, CoRR abs/2004.10934
[4]  
[Anonymous], 2020, IEEE T PATTERN ANAL, DOI DOI 10.1109/TPAMI.2018.2844175
[5]   Automated early yield prediction in vineyards from on-the-go image acquisition [J].
Aquino, Arturo ;
Millan, Borja ;
Diago, Maria-Paz ;
Tardaguila, Javier .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2018, 144 :26-36
[6]   Review. Precision Viticulture. Research topics, challenges and opportunities in site-specific vineyard management [J].
Arno, J. ;
Martinez-Casasnovas, J. A. ;
Ribes-Dasi, M. ;
Rosell, J. R. .
SPANISH JOURNAL OF AGRICULTURAL RESEARCH, 2009, 7 (04) :779-790
[7]   On-the-go sensing of grape berry anthocyanins during commercial harvest: development and prospects [J].
Bramley, R. G. V. ;
Le Moigne, M. ;
Evain, S. ;
Ouzman, J. ;
Florin, L. ;
Fadaili, E. M. ;
Hinze, C. J. ;
Cerovic, Z. G. .
AUSTRALIAN JOURNAL OF GRAPE AND WINE RESEARCH, 2011, 17 (03) :316-326
[8]   Single-Shot Convolution Neural Networks for Real-Time Fruit Detection Within the Tree [J].
Bresilla, Kushtrim ;
Perulli, Giulio Demetrio ;
Boini, Alexandra ;
Morandi, Brunella ;
Grappadelli, Luca Corelli ;
Manfrini, Luigi .
FRONTIERS IN PLANT SCIENCE, 2019, 10
[9]  
Chetlur Sharan, 2014, cudnn: Efficient primitives for deep learning
[10]   Evaluating the Spectral and Physiological Responses of Grapevines (Vitis vinifera L.) to Heat and Water Stresses under Different Vineyard Cooling and Irrigation Strategies [J].
Cogato, Alessia ;
Wu, Lihua ;
Jewan, Shaikh Yassir Yousouf ;
Meggio, Franco ;
Marinello, Francesco ;
Sozzi, Marco ;
Pagay, Vinay .
AGRONOMY-BASEL, 2021, 11 (10)