Method for locating picking points of grape clusters using multi-object recognition

被引：5

作者：

Zhou X. ^{[1
,2
]}

Wu F. ^{[3
,4
]}

Zou X. ^{[2
,4
]}

Meng H. ^{[1
]}

Zhang Y. ^{[2
,5
]}

Luo X. ^{[1
,4
]}

机构：

[1] College of Mechanical and Electrical Engineering, Shihezi University, Shihezi

[2] Foshan-Zhongke Innovation Research Institute of Intelligent Agriculture, Foshan

[3] Guangzhou College of Commerce, Guangzhou

[4] College of Engineering, South China Agricultural University, Guangzhou

[5] Zhongkai University of Agriculture and Engineering, Guangzhou

来源：

Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering | 2023年 / 39卷 / 22期

关键词：

grape peduncle; key structure constraint and fusion; machine vision; multi-object recognition; picking; picking point; robot; YOLACT++;

D O I：

10.11975/j.issn.1002-6819.202309105

中图分类号：

学科分类号：

摘要：

Grape-picking robots can be an effective solution to deal with the contradiction between manual labor efficiency and the limited harvesting period, with the rapid development of machine vision and artificial intelligence. The varying sizes and shapes of grape key structures have limited the working space of the robot at the grape harvesting stage. Improper positioning of picking points can also lead to collisions between the robot's end and the grapes, even the damage and dropping. In addition, it is necessary to consider such collisions between the robot's end and the fruit branches. The reason is that these collisions can result in failed picking, damage to the branches, and the risk of fungal infection in the fruit trees. In this study, the localization algorithm was proposed for the picking points of grape key structures using deep learning and multi-object recognition. Picking point localization was enhanced to reduce the grape damage and the failure rate during harvesting. Firstly, the G-YOLACT++ model incorporated the SimAM attention module and Mish activation function to optimize the YOLACT++ model. Then the key grape structures were detected, such as grape-bearing branches, grape peduncles, and grape clusters. As such, these grape structures in the multi-adjacent clusters were segmented into multiple masks within the field of view. The membership of grape key structures was determined within the same cluster using their intersection and relative positions. The same string of grapes was then merged to select the Region of Interest (ROI) area with the low collision for grape pedicles. The range of re-selection was also designed to locate the picking point. The experimental results demonstrated that the incorporation of the SimAM attention mechanism into the YOLACT++ model resulted in an improved mean average precision (mAP) for the mask. The Mish activation function was selected to replace the ReLU in the backbone network. After that, the mAP values of the mask and bounding box increased by 0.3 and 2.23 percentage points, respectively. Both modifications were greatly contributed to the enhancement of the performance. The average mAP values of the bounding box and mask in G-YOLACT++ were improved by 0.83 and 0.88 percentage points, respectively, compared with the YOLACT++. By contrast, the mAP values of the improved model for the bounding box and mask increased by 2.36 and 2.13 percentage points, respectively, compared with the original. Furthermore, the sizes of all the improved models remained unchanged, while there was a relatively slight improvement in the inference speed. Therefore, there was a positive effect of improvement on the performance of the models. The correctness rates of the single and multiple fruit samples were 88% and 90%, respectively, for the key structure-dependent judgment and fusion. The correctness rate was 92.3% for the removal of grape clusters with the incomplete recognition of key structures. Compared to the two positioning methods that use the center of the bounding rectangle enclosing the grape peduncles in ROI and the centroid of the grape peduncles identified by the model as the picking points, the success rates of the picking point localization method in this study were improved by 10.95 and 81.75 percentage points, respectively. These results demonstrated the research could be a viable support to the optimization of grape picking robots and lays the foundation for low-damage harvesting of clustered fruits in unstructured environments. © 2023 Chinese Society of Agricultural Engineering. All rights reserved.

引用

页码：166 / 177

页数：11

共 44 条

[1] SUN Jun, GONG Dongjian, YAO Kunshan, Et al., Real-time semantic segmentation method for field grapes based on channel feature pyramid, Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 38, 17, pp. 150-157, (2022)
[2] WANG J, ZHANG Z, LUO L, Et al., DualSeg: Fusing transformer and CNN structure for image segmentation in complex vineyard environment, Computers and Electronics in Agriculture, 206, (2023)
[3] LO PIANO S, PARENTI A, GUERRINI L., Uncertainty appraisal provides useful information for the management of a manual grape harvest, Biosystems Engineering, 219, pp. 259-267, (2022)
[4] ZHENG Yongjun, JIANG Shijie, CHEN Bingtai, Et al., Review on technology and equipment of mechanization in hilly orchard review on technology and equipment of mechanization in hilly orchard, Transactions of the Chinese Society for Agricultural Machinery, 51, 11, pp. 1-20, (2020)
[5] ZHOU X., ZOU X, TANG W, Et al., Unstructured road extraction and roadside fruit recognition in grape orchards based on a synchronous detection algorithm, Frontiers in Plant Science, 14, (2023)
[6] BARBOLE D K, JADHAV P M, PATIL S B., A review on fruit detection and segmentation techniques in agricultural field, Second International Conference on Image Processing and Capsule Networks, pp. 269-288, (2022)
[7] LIN Junqiang, WANG Hongjun, ZOU Xiangjun, Et al., Obstacle avoidance path planning and simulation of mobile picking robot based on DPPO, Journal of System Simulation, 35, 8, pp. 1692-1704, (2023)
[8] ZHAO Xiong, CAO Gonghao, ZHANG Pengfei, Et al., Dynamic analysis and lightweight design of 3-DOF apple picking manipulator, Transactions of the Chinese Society for Agricultural Machinery, 54, 7, pp. 88-98, (2023)
[9] CHENG Jiabing, ZOU Xiangjun, CHEN Mingyou, Et al., General 3D perception framework for multiple types complex fruit objects, Automation & Information Engineering, 42, 3, pp. 15-20, (2021)
[10] ZHENG Taixiong, JIANG Mingzhe, FENG Mingchi, Vision based target recognition and location for picking robot: A review, Chinese Journal of Scientific Instrument, 42, 9, pp. 28-51, (2021)

← 1 2 3 4 5 →