i2c-net: Using Instance-Level Neural Networks for Monocular Category-Level 6D Pose Estimation

被引:14
作者
Remus, Alberto [1 ]
D'Avella, Salvatore [1 ]
Di Felice, Francesco [1 ]
Tripicchio, Paolo [1 ]
Avizzano, Carlo Alberto [1 ]
机构
[1] Scuola Super Santana, Mech Intelligence Inst, Dept Excellence Robot & AI, I-56127 PI Pisa, Italy
关键词
Three-dimensional displays; Pose estimation; Solid modeling; Robots; Grasping; Training; Image reconstruction; Perception for grasping and manipulation; deep learning for visual perception; RGB-D perception;
D O I
10.1109/LRA.2023.3240362
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Object detection and pose estimation are strict requirements for many robotic grasping and manipulation applications to endow robots with the ability to grasp objects with different properties in cluttered scenes and with various lighting conditions. This work proposes the framework i2c-net to extract the 6D pose of multiple objects belonging to different categories, starting from an instance-level pose estimation network and relying only on RGB images. The network is trained on a custom-made synthetic photo-realistic dataset, generated from some base CAD models, opportunely deformed, and enriched with real textures for domain randomization purposes. At inference time, the instance-level network is employed in combination with a 3D mesh reconstruction module, achieving category-level capabilities. Depth information is used for post-processing as a correction. Tests conducted on real objects of the YCB-V and NOCS-REAL datasets outline the high accuracy of the proposed approach.
引用
收藏
页码:1515 / 1522
页数:8
相关论文
共 32 条
[1]   Objectron: A Large Scale Dataset of Object-Centric Videos in the Wild with Pose Annotations [J].
Ahmadyan, Adel ;
Zhang, Liangkai ;
Ablavatski, Artsiom ;
Wei, Jianing ;
Grundmann, Matthias .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :7818-7827
[2]   Progress and prospects of the human-robot collaboration [J].
Ajoudani, Arash ;
Zanchettin, Andrea Maria ;
Ivaldi, Serena ;
Albu-Schaeffer, Alin ;
Kosuge, Kazuhiro ;
Khatib, Oussama .
AUTONOMOUS ROBOTS, 2018, 42 (05) :957-975
[3]   A METHOD FOR REGISTRATION OF 3-D SHAPES [J].
BESL, PJ ;
MCKAY, ND .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1992, 14 (02) :239-256
[4]   FS-Net: Fast Shape-based Network for Category-Level 6D Object Pose Estimation with Decoupled Rotation Mechanism [J].
Chen, Wei ;
Jia, Xi ;
Chang, Hyung Jin ;
Duan, Jinming ;
Shen, Linlin ;
Leonardis, Ales .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :1581-1590
[5]   ROS-Industrial based robotic cell for Industry 4.0: Eye-in-hand stereo camera and visual servoing for flexible, fast, and accurate picking and hooking in the line [J].
D'Avella, Salvatore ;
Avizzano, Carlo Alberto ;
Tripiccho, Paolo .
ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2023, 80
[6]   A study on picking objects in cluttered environments: Exploiting depth features for a custom low-cost universal jamming gripper [J].
D'Avella, Salvatore ;
Tripicchio, Paolo ;
Avizzano, Carlo Alberto .
ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2020, 63
[7]  
Denninger M, 2019, Arxiv, DOI arXiv:1911.01911
[8]   SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation [J].
Di, Yan ;
Manhardt, Fabian ;
Wang, Gu ;
Ji, Xiangyang ;
Navab, Nassir ;
Tombari, Federico .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :12376-12385
[9]   Vision-based robotic grasping from object localization, object pose estimation to grasp estimation for parallel grippers: a review [J].
Du, Guoguang ;
Wang, Kai ;
Lian, Shiguo ;
Zhao, Kaiyong .
ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (03) :1677-1734
[10]  
Fan J. Z., 2023, ACM COMPUT SURV, V55, P1