Multi-Object Recognition and 6-DoF Pose Estimation Based on Synthetic Datasets

被引：0

作者：

Hu G. ^{[1
]}

Ou M. ^{[1
]}

Li Z. ^{[1
]}

机构：

[1] School of Mechanical and Automotive Engineering, South China University of Technology, Guangdong, Guangzhou

来源：

Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science) | 2024年 / 52卷 / 04期

关键词：

6-DoF pose estimation; object recognition; position measurement; RGB-D image; robot automatic sorting;

D O I：

10.12141/j.issn.1000-565X.230327

中图分类号：

学科分类号：

摘要：

Multi-object recognition and 6-DoF (degree of freedom) pose estimation are the key to achieve automatic sorting of robots in the state of unordered stacking of materials. In recent years, methods based on deep neural networks have received much attention in the multi-object recognition and 6-DoF pose estimation fields. Such methods rely on a large number of training samples, however, the collection and labeling of samples is time-consuming and laborious, which limits its application. In addition, when the imaging conditions are poor and the targets are occluded by each other, the existing pose estimation methods cannot guarantee the reliability of the results, resulting in grasping failures. To this end, this paper presented a method for target recognition, segmentation and pose estimation based on synthetic data samples. Firstly, multi-view RGB-D synthetic images of virtual scenes were generated using 3D graphics programming tools based on the 3D geometric models of the target objects, and then style transfer and noise enhancement was performed, respectively, on the generated RGB images and the depth images to improve their realism, so that they are suited for the detection in real scenes. Next, the YOLOv7-mask instance segmentation model was trained with synthetic datasets and tested by real data. The results demonstrate the effectiveness of the proposed method. Secondly, the ES6D model was utilized to estimate target poses based on the segmentation results, and an online posture evaluation method was proposed to automatically filter out severely distorted estimation results. Finally, a pose estimation correction strategy based on active vision technique was proposed to guide the robot arm to move to a new viewpoint for re-detection, which can effectively solve the problem of pose estimation deviation caused by occlusion. The above methods have been verified on a self-built 6-DoF industrial robot vision sorting system. The experimental results show that the proposed algorithm can well meet the requirements of recognition and 6-DoF posture estimation of common workpieces in complex environments. © 2024 South China University of Technology. All rights reserved.

引用

页码：42 / 50

页数：8

共 17 条

[1] ZHAI Jing-mei, HUANG Le, Review of unordered picking technology for robots ［J］, Packaging Engineering, 43, 8, pp. 66-75, (2022)
[2] WANG Gao, CHEN Xiaohong, LIU Ning, A robot grasping policy based on viewpoint selection experience enhancement algorithm, Journal of South China University of Technology （Natural Science Edition）, 50, 9, pp. 126-137, (2022)
[3] HINTERSTOISSER S，, LEPETIT V， ILIC S， et al．Model based training， detection and pose estimation of texture-less 3D objects in heavily cluttered scenes ［C］, Proceedings of the 11th Asian Conference on Computer Vision, pp. 548-562, (2013)
[4] DROST B，, ULRICH M，, NAVAB N, Model globally， match locally： efficient and robust 3D object recognition ［C］, Proceedings of 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 998-1005, (2010)
[5] PENG S，, LIU Y，, HUANG Q, PVNet： pixel-wise voting network for 6DoF pose estimation ［C］, Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4561-4570, (2019)
[6] PVN3D：a deep point-wise 3D keypoints voting network for 6DoF pose estimation ［C］, Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11632-11641, (2020)
[7] XIANG Y，, SCHMIDT T，, NARAYANAN V, PoseCNN： a convolutional neural network for 6D object pose estimation in cluttered scenes
[8] WANG C，, XU D，, ZHU Y, DenseFusion： 6D object pose estimation by iterative dense fusion ［C］, Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3343-3352, (2019)
[9] ES6D：a computation efficient and symmetry-aware 6D pose regression framework ［C］, Proceedings of 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6718-6727, (2022)
[10] HAGELSKJAER F，, BUCH A G．, Bridging the reality gap for pose estimation networks using sensor-based domain randomization ［C］, Proceedings of 2021 IEEE/ CVF International Conference on Computer Vision Workshops, pp. 935-944, (2021)

← 1 2 →