A lightweight waxberry fruit detection model based on YOLOv5

被引:2
作者
Yang, Chengyu [1 ]
Liu, Jun [1 ]
He, Jianting [2 ]
机构
[1] Shanghai Dianji Univ, Sch Elect Engn, Shanghai, Peoples R China
[2] Cixi Shanglin Lake Waxberry Plantat, Cixi, Zhejiang, Peoples R China
关键词
agricultural engineering; agricultural products; complex networks; computer vision; feature extraction; image recognition;
D O I
10.1049/ipr2.13064
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to solve the safety and efficiency problems in the picking process of Waxberry, the slow speed and low precision of high-density Waxberry target detection under a complex background were studied. A lightweight Waxberry target detection algorithm based on YOLOv5 is studied. In this study, C3-Faster1 and C3-Faster2 modules are proposed, which are located in the backbone and neck of the network: C3-Faster1 can improve the model speed with a simple structure; C3-Faster2 integrates the context attention mechanism and transform module based on C3-Faster1 to make the network pay attention to the information of Waxberry image context and expand the channel receptive field. A new pyramid module, SPPFCSPC, is proposed to expand the sensing field and improve the accuracy of boundary detection. It also combines the Coordinate Attention (CA) and Dyhead dynamic detection head to suppress useless information and enhance the detection ability of small targets. Compared to YOLOv4, YOLOv7, and YOLOv8, mean accuracy percentage (mAP) improved by 5.7%, 9.4%, 8.3%. Compared to the base YOLOv5 model, mAP has improved from 86.5% to 91.9%, running on 2 GB Jeston nano, and the improved model is 5.03 frames per second (FPS) faster than YOLOv5. Experiments show that the designed module is more effective in Waxberry detection tasks. Waxberry is a berry plant, a type of agricultural by-product, whose harvesting problems have been affected. We improve the target detection model and innovate a new convolutional module to achieve an average accuracy of 91.6%, and the network speed we tested in Jeston nano reaches 11.86 FPS, which is in line with the development of subsequent picking robots. image
引用
收藏
页码:1796 / 1808
页数:13
相关论文
共 48 条
  • [1] Detection of tomato plant phenotyping traits using YOLOv5-based single stage detectors
    Cardellicchio, Angelo
    Solimani, Firozeh
    Dimauro, Giovanni
    Petrozza, Angelo
    Summerer, Stephan
    Cellini, Francesco
    Reno, Vito
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 207
  • [2] Chen, 2022, ARXIV
  • [3] Chollet F., XCEPTION DEEP LEARNI
  • [4] Dynamic Head: Unifying Object Detection Heads with Attentions
    Dai, Xiyang
    Chen, Yinpeng
    Xiao, Bin
    Chen, Dongdong
    Liu, Mengchen
    Yuan, Lu
    Zhang, Lei
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7369 - 7378
  • [5] Design and Implementation of a Flexible Neuromorphic Computing System for Affective Communication via Memristive Circuits
    Dong, Zhekang
    Ji, Xiaoyue
    Lai, Chun Sing
    Qi, Donglian
    [J]. IEEE COMMUNICATIONS MAGAZINE, 2023, 61 (01) : 74 - 80
  • [6] Multiple Memristor Circuit Parametric Fault Diagnosis Using Feedback-Control Doublet Generator
    Dong, Zhekang
    Chaoyong, L.
    Donglian, Q.
    Lu, Li
    Duan, Shukai
    [J]. IEEE ACCESS, 2016, 4 : 2604 - 2614
  • [7] Fan Q., 2019, PROC CVPR IEEE
  • [8] A Dynamic Detection Method for Phenotyping Pods in a Soybean Population Based on an Improved YOLO-v5 Network
    Fu, Xiaoming
    Li, Aokang
    Meng, Zhijun
    Yin, Xiaohui
    Zhang, Chi
    Zhang, Wei
    Qi, Liqiang
    [J]. AGRONOMY-BASEL, 2022, 12 (12):
  • [9] A novel apple fruit detection and counting methodology based on deep learning and trunk tracking in modern orchard
    Gao, Fangfang
    Fang, Wentai
    Sun, Xiaoming
    Wu, Zhenchao
    Zhao, Guanao
    Li, Guo
    Li, Rui
    Fu, Longsheng
    Zhang, Qin
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 197
  • [10] Rich feature hierarchies for accurate object detection and semantic segmentation
    Girshick, Ross
    Donahue, Jeff
    Darrell, Trevor
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587