Automatic Bunch Detection in White Grape Varieties Using YOLOv3, YOLOv4, and YOLOv5 Deep Learning Algorithms

被引:160
作者
Sozzi, Marco [1 ]
Cantalamessa, Silvia [2 ]
Cogato, Alessia [3 ]
Kayad, Ahmed [1 ]
Marinello, Francesco [1 ]
机构
[1] Univ Padua, Dept Land Environm Agr & Forestry, I-35020 Legnaro, Italy
[2] Univ Padua, Dept Agron Food Nat Resources Anim & Environm, I-35020 Legnaro, Italy
[3] Univ Udine, Dept Agr Food Environm & Anim Sci, I-33100 Udine, Italy
来源
AGRONOMY-BASEL | 2022年 / 12卷 / 02期
关键词
viticulture; early yield estimation; real-time detection; cluster detection; smart farming; crop load mapping; yield estimation; precision viticulture; agricultural robot; YIELD PREDICTION; FRUIT DETECTION; VINEYARDS; SENSORS;
D O I
10.3390/agronomy12020319
中图分类号
S3 [农学(农艺学)];
学科分类号
0901 ;
摘要
Over the last few years, several Convolutional Neural Networks for object detection have been proposed, characterised by different accuracy and speed. In viticulture, yield estimation and prediction is used for efficient crop management, taking advantage of precision viticulture techniques. Convolutional Neural Networks for object detection represent an alternative methodology for grape yield estimation, which usually relies on manual harvesting of sample plants. In this paper, six versions of the You Only Look Once (YOLO) object detection algorithm (YOLOv3, YOLOv3-tiny, YOLOv4, YOLOv4-tiny, YOLOv5x, and YOLOv5s) were evaluated for real-time bunch detection and counting in grapes. White grape varieties were chosen for this study, as the identification of white berries on a leaf background is trickier than red berries. YOLO models were trained using a heterogeneous dataset populated by images retrieved from open datasets and acquired on the field in several illumination conditions, background, and growth stages. Results have shown that YOLOv5x and YOLOv4 achieved an F1-score of 0.76 and 0.77, respectively, with a detection speed of 31 and 32 FPS. Differently, YOLO5s and YOLOv4-tiny achieved an F1-score of 0.76 and 0.69, respectively, with a detection speed of 61 and 196 FPS. The final YOLOv5x model for bunch number, obtained considering bunch occlusion, was able to estimate the number of bunches per plant with an average error of 13.3% per vine. The best combination of accuracy and speed was achieved by YOLOv4-tiny, which should be considered for real-time grape yield estimation, while YOLOv3 was affected by a False Positive-False Negative compensation, which decreased the RMSE.
引用
收藏
页数:17
相关论文
共 63 条
[51]  
Sozzi M., 2019, PRECISION AGR 19, V19, P501, DOI DOI 10.3920/978-90-8686-888-9_62
[52]   Economic Comparison of Satellite, Plane and UAV-Acquired NDVI Images for Site-Specific Nitrogen Application: Observations from Italy [J].
Sozzi, Marco ;
Kayad, Ahmed ;
Gobbo, Stefano ;
Cogato, Alessia ;
Sartori, Luigi ;
Marinello, Francesco .
AGRONOMY-BASEL, 2021, 11 (11)
[53]   Considerations on spatial crop load mapping [J].
Taylor, J. A. ;
Dresser, J. L. ;
Hickey, C. C. ;
Nuske, S. T. ;
Bates, T. R. .
AUSTRALIAN JOURNAL OF GRAPE AND WINE RESEARCH, 2019, 25 (02) :144-155
[54]  
Taylor JA, 2016, J INT SCI VIGNE VIN, V50, P57
[55]   Apple detection during different growth stages in orchards using the improved YOLO-V3 model [J].
Tian, Yunong ;
Yang, Guodong ;
Wang, Zhe ;
Wang, Hao ;
Li, En ;
Liang, Zize .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2019, 157 :417-426
[56]   Deep Learning for Computer Vision: A Brief Review [J].
Voulodimos, Athanasios ;
Doulamis, Nikolaos ;
Doulamis, Anastasios ;
Protopapadakis, Eftychios .
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2018, 2018
[57]   CSPNet: A New Backbone that can Enhance Learning Capability of CNN [J].
Wang, Chien-Yao ;
Liao, Hong-Yuan Mark ;
Wu, Yueh-Hua ;
Chen, Ping-Yang ;
Hsieh, Jun-Wei ;
Yeh, I-Hau .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :1571-1580
[58]   An improved tiny-yolov3 pedestrian detection algorithm [J].
Zhang, Yi ;
Shen, Yongliang ;
Zhang, Jun .
OPTIK, 2019, 183 :17-23
[59]   Faster-YOLO: An accurate and faster object detection method [J].
Yin, Yunhua ;
Li, Huifang ;
Fu, Wei .
DIGITAL SIGNAL PROCESSING, 2020, 102
[60]   Object Detection With Deep Learning: A Review [J].
Zhao, Zhong-Qiu ;
Zheng, Peng ;
Xu, Shou-Tao ;
Wu, Xindong .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (11) :3212-3232