Unreal mask: one-shot multi-object class-based pose estimation for robotic manipulation using keypoints with a synthetic dataset

被引:0
作者
S. H. Zabihifar
A. N. Semochkin
E. V. Seliverstova
A. R. Efimov
机构
[1] PJSC Sber,Sber Robotics Laboratory
[2] National Research Technology University “MISiS”,undefined
来源
Neural Computing and Applications | 2021年 / 33卷
关键词
Pose estimation; Synthetic dataset; Object grasping; Deep learning; Robotic manipulation; Sim to real;
D O I
暂无
中图分类号
学科分类号
摘要
Object pose estimation is a prerequisite for many robotic applications. Preparing dataset for network training is a challenging part of the pose estimation approaches, and in most of them, the network can detect just the trained objects. Synthetic data are used to train deep neural networks in robotic manipulation as a promising method for obtaining a huge amount of prelabeled training data, which are generated safely. We are to investigate the reality gap in the pose estimation of intra-category objects from a single RGB-D image using keypoints. The proposed approach in this paper provides a fast and simple procedure for training a deep neural network to identify the object and its keypoints based on synthetic dataset and autolabeling program. To our knowledge, this is the first deep network trained only on synthetic data that can find keypoints of intra-category objects for pose estimation purposes. The speed of training and the simplicity of this method make it very easy to add a new class of objects to the system which is the main advantage of this approach. Using this approach, we demonstrate a near-real-time system estimating object poses with sufficient accuracy for real-world semantic grasping and manipulating of intra-category objects in clutter by a real robot.
引用
收藏
页码:12283 / 12300
页数:17
相关论文
共 15 条
[1]  
Wong JM(2017)SegICP: integrated deep semantic segmentation and pose estimation IEEE Int Conf Intell Robot Syst 2017 5784-5789
[2]  
Li Y(2019)DeepIM: deep iterative matching for 6D pose estimation Int J Comput Vis 128 657-427
[3]  
Wang G(2017)Self-supervised visual descriptor learning for dense correspondence IEEE Robot Autom Lett 2 420-724
[4]  
Ji X(2015)Deep learning for detecting robotic grasps Int J Rob Res 34 705-397
[5]  
Xiang Y(2020)Mask R-CNN IEEE Trans Pattern Anal Mach Intell 42 386-undefined
[6]  
Fox D(undefined)undefined undefined undefined undefined-undefined
[7]  
Schmidt T(undefined)undefined undefined undefined undefined-undefined
[8]  
Newcombe R(undefined)undefined undefined undefined undefined-undefined
[9]  
Fox D(undefined)undefined undefined undefined undefined-undefined
[10]  
Lenz I(undefined)undefined undefined undefined undefined-undefined