Fast Recognition of Greenhouse Tomato Targets Based on Attention Mechanism and Improved YOLO

被引:0
|
作者
Zhang J. [1 ]
Bi Z. [1 ]
Yan Y. [1 ]
Wang P. [1 ]
Hou C. [2 ]
Lu S. [2 ]
机构
[1] Mechanical and Electrical Engineering College, Beijing Information Science and Technology University, Beijing
[2] Chinese Academy of Agricultural Mechanization Sciences Croup Co., Ltd., Beijing
关键词
attention mechanism; greenhouse tomato; loss function; objection detection; YOLO;
D O I
10.6041/j.issn.1000-1298.2023.05.024
中图分类号
学科分类号
摘要
In order to realize the rapid and accurate recognition of greenhouse tomato fruit by agricultural picking robot in the complicated environment of greenhouse, a fast target detection method for greenhouse tomato fruit based on attention mechanism and improved YOLO v5s was proposed. According to the characteristics of small size and fast speed of YOLO v5s (You only look once v5s ) model, the convolutional block attention module (CBAM) was added into the backbone network. By concatenating spatial attention module and channel attention module, the problem of color similarity between green tomato fruit and its background was solved. More attention was paid to the target features of green tomato fruit to improve the recognition accuracy. Replacing GIoU Loss with CIoU Loss as the new loss function of the algorithm contributed to improve the positioning accuracy while improving the bounding box regression rate. The test results showed that the recognition accuracy of the CB - YOLO network model for red tomato fruit detecting precision and green tomato fruit detecting precision and mean average precision in greenhouse environment was 99. 88%, 98. 18% and 99. 53%, respectively. Compared with Faster R - CNN network model, YOLO v4 - tiny network model and YOLO v5 network model, the detection accuracy and the mean average precision were improved. The CB -YOLO model was deployed to Android system of mobile phones after being tested by different mobile phones, which verified the stability of the performance detection of the deployment model under actual working condition. It will provide technical support for target detection and harvesting based on robotic mobile edge computing in facility environments. © 2023 Chinese Society of Agricultural Machinery. All rights reserved.
引用
收藏
页码:236 / 243
页数:7
相关论文
共 22 条
  • [1] HUO Jianyong, Status and safety prevention of tomato industry in China, Vegetable, 6, pp. 1-4, (2016)
  • [2] LI Tianhua, SUN Meng, DING Xiaoming, Et al., Tomato recognition method at the ripening stage based on YOLO v4 and HSV, Transactions of the CSAE, 37, 21, pp. 183-190, (2021)
  • [3] LIANG Xifeng, ZHANG Yan, Recognition method of picking point for tomato cluster [J], Journal of Chinese Agricultural Mechanization, 37, 11, pp. 131-134, (2016)
  • [4] FENG Qingchun, CHENG Wei, YANG Qinghua, Et al., Identification and localization of overlapping tomatoes based on linear structured light vision system, Journal of China Agricultural University, 20, 4, pp. 100-106, (2015)
  • [5] MA Cuihua, ZHANG Xueping, LI Yutao, Et al., Identification of immature tomatoes base on salient region detection and improved Hough transform method [J], Transactions of the CSAE, 32, 14, pp. 219-226, (2016)
  • [6] ZHOU Yuncheng, XU Tongyu, ZHENG Wei, Et al., Classification and recognition approaches of tomato main organs based on DCNN[J], Transactions of the CSAE, 33, 15, pp. 219-226, (2017)
  • [7] LIU Z, WU J, FU L, Et al., Improved kiwifruit detection using pre-trained VGG16 with RGB and NIR information fusion[J], IEEE Access, 8, pp. 2327-2336, (2020)
  • [8] WANG Wenjie, GONG Liang, WANG Tao, Et al., Tomato fruit recognition based on multi-source fusion image segmentation algorithm in open environment, Transactions of the Chinese Society for Agricultural Machinery, 52, 9, pp. 156-164, (2021)
  • [9] GIRSHICK R, DONAHUE J, DARRELL T, Et al., Rich feature hierarchies for accurate object detection and semantic segmentation[C], IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 580-587, (2014)
  • [10] LI J, LIANG X, SHEN S M, Et al., Scale-aware Fast R-CNN for pedestrian detection [C], IEEE Transactions on Multimedia, (2015)