Improved YOLOv7 based apple target detection in complex environment

被引:0
作者
Mo, Henghui [1 ]
Wei, Linjing [1 ]
机构
[1] College of Information Science and Technology, Gansu Agricultural University, Lanzhou
来源
Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science) | 2024年 / 58卷 / 12期
关键词
activation function; apple target detection; attention mechanism; Grad-CAM; small target detection; YOLOv7;
D O I
10.3785/j.issn.1008-973X.2024.12.004
中图分类号
学科分类号
摘要
Robotic harvesters face challenges in identifying apples under complex natural conditions such as unstable lighting, high fruit diversity, and severe leaf occlusion, which impedes the capture of key features, reducing harvesting efficiency and accuracy. An enhanced apple detection algorithm based on the YOLOv7 model for complex scenarios was proposed. A limited contrast adaptive histogram equalization technique was employed to enhance the contrast of apple images, reducing the background interference and clarifying the target contours. A multi-scale hybrid adaptive attention mechanism was introduced. The features were decomposed and reconstructed, and the spatial and channel attention directives were synergistically integrated to optimize multi-layer feature modeling over various distances, thereby boosting the model’s capability to extract apple features and resist background noise. Full-dimensional dynamic convolution was implemented to refine the feature selection process through a meticulous attention mechanism. The number of detection heads was increased to address the challenges of detecting small targets. The Meta-ACON activation function was used to optimize the attention allocation during feature extraction process. Experimental results demonstrated that the improved YOLOv7 model, achieved average accuracy and recall rates of 85.7% and 87.0%, respectively. Compared to Faster R-CNN, SSD, YOLOv5, and the original YOLOv7, the average detection precision was improved by 15.2, 7.5, 4.5, and 2.5 percentage points, and the average recall was improved by 13.7, 6.5, 3.6, and 1.3 percentage points, respectively. The model exhibits exceptional performance, providing robust technical support for apple growth monitoring and mechanical harvesting research. © 2024 Zhejiang University. All rights reserved.
引用
收藏
页码:2447 / 2458
页数:11
相关论文
共 26 条
  • [1] HUO Xuexi, LIU Tianjun, LIU Jundi, Et al., 2020 China apple industry development report: simplified version [J], Chinese Fruits and Vegetables, 42, 2, pp. 1-6, (2022)
  • [2] Editorial Board of China Fruit Industry Information, Distribution and change of major fruit production in provinces (autonomous regions and municipalities) of China in 2021, China Fruit Industry Information, 38, 11, (2023)
  • [3] MEN Xiaopeng, XU Xuefeng, WANG Yubin, Et al., Current status, problems and development strategies of apple production in my country [J], Rural Practical Technology, 2022, 1, pp. 25-27
  • [4] ZHAO Ying, XIAO Hongru, MEI Song, Et al., Current situation and development strategy of mechanized production of orchards in China [J], Journal of China Agricultural University, 22, 6, pp. 116-127, (2017)
  • [5] LU Mengmeng, JIANG Shan, ZHANG Guohao, Et al., Research progress on chemical flower and fruit thinning technology of apples [J], Chinese Fruit Tree, 210, 4, pp. 4-7, (2021)
  • [6] LI Chengpeng, SHANG Shuqi, WANG Dongwei, Et al., Design and test of vibrating high-acid apple picking machine [J], Agricultural Mechanization Research, 46, 4, pp. 106-113, (2024)
  • [7] BU L, HU G, CHEN C, Et al., Experimental and simulation analysis of optimum picking patterns for robotic apple harvesting, Scientia Horticulturae, 261, (2020)
  • [8] HE K, ZHANG X, REN S, Et al., Deep residual learning for image recognition [C], IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, (2016)
  • [9] LI J, LIANG X, SHEN S M, Et al., Scale-aware fast R-CNN for pedestrian detection [J], IEEE Transactions on Multimedia, 20, 4, pp. 985-996, (2017)
  • [10] REN S, HE K, GIRSHICK R, Et al., Faster R-CNN: towards real-time object detection with region proposal networks [J], IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 6, pp. 1137-1149, (2016)