Study on fusion clustering and improved YOLOv5 algorithm based on multiple occlusion of Camellia oleifera fruit

被引:48
作者
Chen, Shang [1 ,2 ]
Zou, Xiangjun [2 ,3 ]
Zhou, Xinzhao [2 ]
Xiang, Yang [1 ]
Wu, Mingliang [1 ]
机构
[1] Hunan Agr Univ, Coll Mech & Elect Engn, Changsha 410128, Peoples R China
[2] Foshan ZhongKe Innovat Res Inst Intelligent Agr &, Foshan 528200, Peoples R China
[3] Foshan Sinotech Ind Technol Res Inst, Foshan, Peoples R China
基金
中国国家自然科学基金;
关键词
Neural Network; Multi-Class Target Detection; Coordinate Attention; Partial Occlusion;
D O I
10.1016/j.compag.2023.107706
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
The detection method of Camellia oleifera fruits is not only an essential part of the Camellia oleifera fruit detection and locating process, but also a key foundation of the vision robot harvesting. Due to the occlusions of branches and leaves, as well as the overlap of fruits, it is a daunting task to improve the precision of Camellia oleifera fruit detection in a natural environment. In this paper, we propose a YOLO-COF lightweight multiclass occlusion target detection method for Camellia oleifera fruit, introducing K-means++ clustering algorithm under the framework of YOLOv5s to automatically filter and select the target dataset; incorporating Coordinate Attention to enhance the feature extraction ability of occluded target and improve the generalizability and robustness of the model. The study assesses four categories of datasets: complete fruit occlusion NO, fruit occluded by leaf OL, fruit occluded by branches OB, and fruit occluded by other fruits OF, combining the new detection model and comparing with five Faster R-CNN, YOLOv4-MobileNetV3, YOLOv4-CSPDarknet, YOLOv5s, and YOLOv5x models. The experimental results indicate that the AP values of YOLO-COF for the detection of NO, OL, OB, and OF are 0.97, 0.94, 0.93, and 0.92, respectively. In comparison to other neural network models, YOLO-COF is well-balanced in terms of precision, speed, and size. Its mAP value is 94.10%, its frame rate is 74.8FPS, and its model size is 27.1 MB. The proposed model has the potential to facilitate the operational decision-making of vision robot during the process of Camellia oleifera fruit harvesting.
引用
收藏
页数:13
相关论文
共 37 条
[1]   Integration of visible branch sections and cherry clusters for detecting cherry tree branches in dense foliage canopies [J].
Amatya, Suraj ;
Karkee, Manoj .
BIOSYSTEMS ENGINEERING, 2016, 149 :72-81
[2]   A deep learning approach for anthracnose infected trees classification in walnut orchards [J].
Anagnostis, A. ;
Tagarakis, A. C. ;
Asiminari, G. ;
Papageorgiou, E. ;
Kateris, D. ;
Moshou, D. ;
Bochtis, D. .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 182
[3]  
Arthur D, 2007, PROCEEDINGS OF THE EIGHTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P1027
[4]   A Multi-Objective Particle Swarm Optimization for Trajectory Planning of Fruit Picking Manipulator [J].
Cao, Xiaoman ;
Yan, Hansheng ;
Huang, Zhengyan ;
Ai, Si ;
Xu, Yongjun ;
Fu, Renxuan ;
Zou, Xiangjun .
AGRONOMY-BASEL, 2021, 11 (11)
[5]   Three-dimensional perception of orchard banana central stock enhanced by adaptive multi-vision technology [J].
Chen, Mingyou ;
Tang, Yunchao ;
Zou, Xiangjun ;
Huang, Kuangyu ;
Huang, Zhaofeng ;
Zhou, Hao ;
Wang, Chenglin ;
Lian, Guoping .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 174
[6]   Kiwifruit detection in field images using Faster R-CNN with ZFNet [J].
Fu, Longsheng ;
Feng, Yali ;
Majeed, Yaqoob ;
Zhang, Xin ;
Zhang, Jing ;
Karkee, Manoj ;
Zhang, Qin .
IFAC PAPERSONLINE, 2018, 51 (17) :45-50
[7]   Multi-class fruit-on-plant detection for apple in SNAP system using Faster R-CNN [J].
Gao, Fangfang ;
Fu, Longsheng ;
Zhang, Xin ;
Majeed, Yaqoob ;
Li, Rui ;
Karkee, Manoj ;
Zhang, Qin .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 176
[8]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[9]   Coordinate Attention for Efficient Mobile Network Design [J].
Hou, Qibin ;
Zhou, Daquan ;
Feng, Jiashi .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13708-13717
[10]   Real-time detection of underwater river crab based on multi-scale pyramid fusion image enhancement and MobileCenterNet model [J].
Ji, Wei ;
Peng, Jiaqi ;
Xu, Bo ;
Zhang, Tong .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 204