LES-YOLO: A lightweight pinecone detection algorithm based on improved YOLOv4-Tiny network

被引:39
作者
Cui, Mingdi [1 ]
Lou, Yunyi [1 ]
Ge, Yilin [1 ]
Wang, Keqi [1 ]
机构
[1] Northeast Forestry Univ, Coll Mech & Elect Engn, Harbin 150040, Peoples R China
关键词
Pinecone detection; YOLOv4-Tiny; Lightweight Enhance ShuffleNet; Depthwise separable convolution; Channel compression; CONVOLUTIONAL NETWORKS;
D O I
10.1016/j.compag.2023.107613
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Pinecones are an important forest product that are often harvested manually. Automated pinecone detection and harvesting might increase the yield and resolve the problem of labor shortage. The quick and accurate detection of pinecones is one of the most critical technologies for ground picking equipment and very significant for reducing the leakage rate and increasing the yield. Currently, most solutions use deep learning to achieve pinecone recognition. However, deep learning has poor detection effect for small targets, large parameters, and slow operation speed; thus, there is scope for improving the pinecone recognition task. This study proposes a lightweight pinecone detection algorithm based on an improved You Only Look Once version 4 tiny (YOLOv4Tiny) algorithm. A lightweight Enhance ShuffleNet is used as a backbone to extract the pinecone features. A squeeze-and-excitation feature pyramid network is used to fuse multi-scale information, retaining only a 26 x 26 detection head for pinecone prediction. In addition, focal loss and alpha-complete intersection over union loss are used to reduce undetected and false detection rates. Data augmentation is used to improve the generalization of the algorithm, and channel compression reduces the number of model parameters. The average precision of the improved network reaches 95.33%, which is an increase of 3.56% compared with the original YOLOv4-Tiny network. Additionally, the parameters, floating point operations, and detection time of one picture are compressed to 12.22%, 17.35%, and 67.41% of the original network, respectively. Comparative experiments were conducted using the proposed method and common target- detection algorithms. The findings show that the average precision of the proposed method is higher than the corresponding values of the YOLOv4, YOLOv5x, YOLOv5s, and YOLOX-Tiny algorithms (95.17%, 95.27%, 92.10%, and 90.47%, respectively), with parameters decreasing to 1.13%, 0.83%, 10.20%, and 14.34%, respectively. The proposed algorithm detects 135 pictures per second to achieve real-time effects. Further, it can distinguish pinecones quickly and accurately in a near-color background and provide a more precise and lightweight solution for pickup and acquisition equipment. In addition, the design process of the proposed algorithm could be a reference for neural network design for agricultural applications.
引用
收藏
页数:13
相关论文
共 38 条
[1]  
[Anonymous], ADV NEURAL INFORM PR
[2]  
Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[3]  
Chen PG, 2024, Arxiv, DOI arXiv:2001.04086
[4]  
Howard AG, 2017, Arxiv, DOI arXiv:1704.04861
[5]  
Ge Z, 2021, Arxiv, DOI [arXiv:2107.08430, 10.48550/arXiv.2107.08430, DOI 10.48550/ARXIV.2107.08430]
[6]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[7]   Region-Based Convolutional Networks for Accurate Object Detection and Segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (01) :142-158
[8]   GhostNet: More Features from Cheap Operations [J].
Han, Kai ;
Wang, Yunhe ;
Tian, Qi ;
Guo, Jianyuan ;
Xu, Chunjing ;
Xu, Chang .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :1577-1586
[9]  
Hannan M.W., 2009, AGR ENG INT CIGR E J
[10]   Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (09) :1904-1916