DPD-YOLO: dense pineapple fruit target detection algorithm in complex environments based on YOLOv8 combined with attention mechanism

被引:0
作者
Lin, Cong [1 ]
Jiang, Wencheng [1 ]
Zhao, Weiye [1 ]
Zou, Lilan [1 ]
Xue, Zhong [2 ]
机构
[1] Guangdong Ocean Univ, Sch Elect & Informat Engn, Zhanjiang, Peoples R China
[2] Chinese Acad Trop Agr Sci, South Subtrop Crops Res Inst, Zhanjiang, Peoples R China
来源
FRONTIERS IN PLANT SCIENCE | 2025年 / 16卷
关键词
pineapple detection; UAV; BiFPN; YOLOv8; coordinate attention;
D O I
10.3389/fpls.2025.1523552
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
With the development of deep learning technology and the widespread application of drones in the agricultural sector, the use of computer vision technology for target detection of pineapples has gradually been recognized as one of the key methods for estimating pineapple yield. When images of pineapple fields are captured by drones, the fruits are often obscured by the pineapple leaf crowns due to their appearance and planting characteristics. Additionally, the background in pineapple fields is relatively complex, and current mainstream target detection algorithms are known to perform poorly in detecting small targets under occlusion conditions in such complex backgrounds. To address these issues, an improved YOLOv8 target detection algorithm, named DPD-YOLO (Dense-Pineapple-Detection YOU Only Look Once), has been proposed for the detection of pineapples in complex environments. The DPD-YOLO model is based on YOLOv8 and introduces the attention mechanism (Coordinate Attention) to enhance the network's ability to extract features of pineapples in complex backgrounds. Furthermore, the small target detection layer has been fused with BiFPN (Bi-directional Feature Pyramid Network) to strengthen the integration of multi-scale features and enrich the extraction of semantic features. At the same time, the original YOLOv8 detection head has been replaced by the RT-DETR detection head, which incorporates Cross-Attention and Self-Attention mechanisms that improve the model's detection accuracy. Additionally, Focaler-IoU has been employed to improve CIoU, allowing the network to focus more on small targets. Finally, high-resolution images of the pineapple fields were captured using drones to create a dataset, and extensive experiments were conducted. The results indicate that, compared to existing mainstream target detection models, the proposed DPD-YOLO demonstrated superior detection performance for pineapples in situations where the background is complex and the targets are occluded. The mAP@0.5 reached 62.0%, representing an improvement of 6.6% over the original YOLOv8 algorithm, Precision increased by 2.7%, Recall improved by 13%, and F1-score rose by 10.3%.
引用
收藏
页数:16
相关论文
共 35 条
[1]   The effect of data augmentation and network simplification on the image-based detection of broccoli heads with Mask R-CNN [J].
Blok, Pieter M. ;
van Evert, Frits K. ;
Tielen, Antonius P. M. ;
van Henten, Eldert J. ;
Kootstra, Gert .
JOURNAL OF FIELD ROBOTICS, 2021, 38 (01) :85-104
[2]   Texture-based fruit detection [J].
Chaivivatrakul, Supawadee ;
Dailey, Matthew N. .
PRECISION AGRICULTURE, 2014, 15 (06) :662-683
[3]  
Chowdhury G. G., 2010, Introduction to modern information retrieval, DOI [10.1080/00048623.2010.10721488, DOI 10.1080/00048623.2010.10721488]
[4]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[5]   YOLOv5-T: A precise real-time detection method for maize tassels based on UAV low altitude remote sensing images [J].
Gao, Rui ;
Jin, Yishu ;
Tian, Xin ;
Ma, Zheng ;
Liu, Siqi ;
Su, Zhongbin .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 221
[6]   Attention mechanisms in computer vision: A survey [J].
Guo, Meng-Hao ;
Xu, Tian-Xing ;
Liu, Jiang-Jiang ;
Liu, Zheng-Ning ;
Jiang, Peng-Tao ;
Mu, Tai-Jiang ;
Zhang, Song-Hai ;
Martin, Ralph R. ;
Cheng, Ming-Ming ;
Hu, Shi-Min .
COMPUTATIONAL VISUAL MEDIA, 2022, 8 (03) :331-368
[7]  
He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[8]   Support vector machines [J].
Hearst, MA .
IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1998, 13 (04) :18-21
[9]   Coordinate Attention for Efficient Mobile Network Design [J].
Hou, Qibin ;
Zhou, Daquan ;
Feng, Jiashi .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13708-13717
[10]  
Jumaah H. J., 2024, J. Optics Photonics Res, P1, DOI [10.47852/bonviewJOPR42022920, DOI 10.47852/BONVIEWJOPR42022920]