A Seamless Deep Learning Approach for Apple Detection, Depth Estimation, and Tracking Using YOLO Models Enhanced by Multi-Head Attention Mechanism

被引:10
作者
Sekharamantry, Praveen Kumar [1 ,2 ]
Melgani, Farid [1 ]
Malacarne, Jonni [1 ]
Ricci, Riccardo [1 ]
de Almeida Silva, Rodrigo [3 ]
Marcato Jr, Jose [3 ]
机构
[1] Univ Trento, Dept Informat Engn & Comp Sci, I-38123 Trento, Italy
[2] GITAM Deemed Univ, GITAM Sch Technol, Dept Comp Sci & Engn, Visakhapatnam, India
[3] Univ Fed Mato Grosso do Sul, Fac Engn Architecture & Urbanism & Geog, BR-79070900 Campo Grande, Brazil
关键词
apple detection; depth estimation; multi-head attention mechanism; ByteTrack; RECOGNITION; VISION;
D O I
10.3390/computers13030083
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Considering precision agriculture, recent technological developments have sparked the emergence of several new tools that can help to automate the agricultural process. For instance, accurately detecting and counting apples in orchards is essential for maximizing harvests and ensuring effective resource management. However, there are several intrinsic difficulties with traditional techniques for identifying and counting apples in orchards. To identify, recognize, and detect apples, apple target detection algorithms, such as YOLOv7, have shown a great deal of reflection and accuracy. But occlusions, electrical wiring, branches, and overlapping pose severe issues for precisely detecting apples. Thus, to overcome these issues and accurately recognize apples and find the depth of apples from drone-based videos in complicated backdrops, our proposed model combines a multi-head attention system with the YOLOv7 object identification framework. Furthermore, we provide the ByteTrack method for apple counting in real time, which guarantees effective monitoring of apples. To verify the efficacy of our suggested model, a thorough comparison assessment is performed with several current apple detection and counting techniques. The outcomes adequately proved the effectiveness of our strategy, which continuously surpassed competing methods to achieve exceptional accuracies of 0.92, 0.96, and 0.95 with respect to precision, recall, and F1 score, and a low MAPE of 0.027, respectively.
引用
收藏
页数:25
相关论文
共 56 条
[1]   Recognition and Counting of Apples in a Dynamic State Using a 3D Camera and Deep Learning Algorithms for Robotic Harvesting Systems [J].
Abeyrathna, R. M. Rasika D. ;
Nakaguchi, Victor Massaki ;
Minn, Arkar ;
Ahamed, Tofael .
SENSORS, 2023, 23 (08)
[2]   Intelligent System for Estimation of the Spatial Position of Apples Based on YOLOv3 and Real Sense Depth Camera D415 [J].
Andriyanov, Nikita ;
Khasanshin, Ilshat ;
Utkin, Daniil ;
Gataullin, Timur ;
Ignar, Stefan ;
Shumaev, Vyacheslav ;
Soloviev, Vladimir .
SYMMETRY-BASEL, 2022, 14 (01)
[3]   Digital Transformation of Agriculture through the Use of an Interoperable Platform [J].
Antonio Lopez-Morales, Juan ;
Antonio Martinez, Juan ;
Skarmeta, Antonio F. .
SENSORS, 2020, 20 (04)
[4]  
Bewley A, 2016, IEEE IMAGE PROC, P3464, DOI 10.1109/ICIP.2016.7533003
[5]   ATSS Deep Learning-Based Approach to Detect Apple Fruits [J].
Biffi, Leonardo Josoe ;
Mitishita, Edson ;
Liesenberg, Veraldo ;
dos Santos, Anderson Aparecido ;
Goncalves, Diogo Nunes ;
Estrabis, Nayara Vasconcelos ;
Silva, Jonathan de Andrade ;
Osco, Lucas Prado ;
Ramos, Ana Paula Marques ;
Centeno, Jorge Antonio Silva ;
Schimalski, Marcos Benedito ;
Rufato, Leo ;
Neto, Silvio Luis Rafaeli ;
Marcato Junior, Jose ;
Goncalves, Wesley Nunes .
REMOTE SENSING, 2021, 13 (01) :1-23
[6]  
Brock H., 2018, P 11 INT C LANGUAGE, P7
[7]  
Bulanon D., 2001, P 2001 ASAE ANN M, DOI [10.13031/2013.3672, DOI 10.13031/2013.3672]
[8]  
Bulanon D. M., 2010, Agricultural Engineering International: CIGR Journal, V12, P203
[9]   EFFECTIVE FEATURE FUSION NETWORK IN BIFPN FOR SMALL OBJECT DETECTION [J].
Chen, Jun ;
Mai, HongSheng ;
Luo, Linbo ;
Chen, Xiaoqiang ;
Wu, Kangle .
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, :699-703
[10]   An Efficient Method for Monitoring Birds Based on Object Detection and Multi-Object Tracking Networks [J].
Chen, Xian ;
Pu, Hongli ;
He, Yihui ;
Lai, Mengzhen ;
Zhang, Daike ;
Chen, Junyang ;
Pu, Haibo .
ANIMALS, 2023, 13 (10)