Synthetic Depth Image-Based Category-Level Object Pose Estimation With Effective Pose Decoupling and Shape Optimization

被引:4
作者
Yu, Sheng [1 ]
Zhai, Di-Hua [1 ]
Xia, Yuanqing [1 ,2 ]
机构
[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China
[2] Zhongyuan Univ Technol, Sch Automat, Zhengzhou 450007, Henan, Peoples R China
基金
中国国家自然科学基金;
关键词
Pose estimation; Three-dimensional displays; Point cloud compression; Solid modeling; Shape; Feature extraction; Computational modeling; 3-D reconstruction; object detection; point sampling; pose estimation; shape optimization;
D O I
10.1109/TIM.2024.3427799
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Category-level object pose estimation is a crucial task in the field of computer vision and finds numerous applications. However, the presence of unknown objects, significant shape, and scale variations within the same category pose challenges in this task. To address these challenges and achieve efficient and accurate category-level object pose estimation, we present EffectPose in this article. We first observe that objects of the same category often possess similar key regions, such as handles on cups. These key regions can establish correspondences for spatial poses, enabling pose estimation. To facilitate this, we employ a segmentation network to divide point clouds into multiple parts and map them to a shared latent space. Subsequently, by considering the correspondences between predicted implicit models and real point clouds for various key regions, we accomplish pose estimation. Since real object point clouds are typically dense and contain outliers, we propose a novel point cloud sampling network that can accurately select representative points for efficient correspondence construction. Furthermore, we decouple the scale and pose of objects based on the SIM(3) invariant descriptor and propose an online pose optimization method using this descriptor. This method enables online prediction and optimization of poses. Finally, to enhance pose estimation accuracy, we introduce a distance-weighted pose optimization method for pose refinement and adjustment. Experimental results demonstrate that our proposed method achieves efficient pose estimation and generalization by utilizing only synthetic depth images and a minimal number of network parameters, surpassing the performance of most existing methods.
引用
收藏
页数:18
相关论文
共 50 条
[21]   Self-Supervised Category-Level 6D Object Pose Estimation With Optical Flow Consistency [J].
Zaccaria, Michela ;
Manhardt, Fabian ;
Di, Yan ;
Tombari, Federico ;
Aleotti, Jacopo ;
Giorgini, Mikhail .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (05) :2510-2517
[22]   i2c-net: Using Instance-Level Neural Networks for Monocular Category-Level 6D Pose Estimation [J].
Remus, Alberto ;
D'Avella, Salvatore ;
Di Felice, Francesco ;
Tripicchio, Paolo ;
Avizzano, Carlo Alberto .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (03) :1515-1522
[23]   Image-based aircraft pose estimation using moment invariants [J].
Breuers, MG .
AUTOMATIC TARGET RECOGNITION IX, 1999, 3718 :294-304
[24]   CSS-Net: Domain Generalization in Category-level Pose Estimation via Corresponding Structural Superpoints [J].
Jin, Li ;
Song, Xibin ;
Li, Jia ;
Tu, Changhe ;
Qin, Xueying .
2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024, 2024,
[25]   Human pose estimation method based on single depth image [J].
Wu, Qingqiang ;
Xu, Guanghua ;
Li, Min ;
Chen, Longting ;
Zhang, Xin ;
Xie, Jun .
IET COMPUTER VISION, 2018, 12 (06) :919-924
[26]   Category Level Object Pose Estimation via Global High-Order Pooling [J].
Jiang, Changhong ;
Mu, Xiaoqiao ;
Zhang, Bingbing ;
Xie, Mujun ;
Liang, Chao .
ELECTRONICS, 2024, 13 (09)
[27]   Refined Prior Guided Category-Level 6D Pose Estimation and Its Application on Robotic Grasping [J].
Sun, Huimin ;
Zhang, Yilin ;
Sun, Honglin ;
Hashimoto, Kenji .
APPLIED SCIENCES-BASEL, 2024, 14 (17)
[28]   Incorporating structure from motion uncertainty into image-based pose estimation [J].
Ludington, Ben T. ;
Brown, Andrew P. ;
Sheffler, Michael J. ;
Taylor, Clark N. ;
Berardi, Stephen .
GEOSPATIAL INFORMATICS, FUSION, AND MOTION VIDEO ANALYTICS V, 2015, 9473
[29]   A Transformer-Based Network for Full Object Pose Estimation with Depth Refinement [J].
Abdulsalam, Mahmoud ;
Ahiska, Kenan ;
Aouf, Nabil .
ADVANCED INTELLIGENT SYSTEMS, 2024, 6 (10)
[30]   Resolving Symmetry Ambiguity in Correspondence-Based Methods for Instance-Level Object Pose Estimation [J].
Lin, Yongliang ;
Su, Yongzhi ;
Inuganti, Sandeep ;
Di, Yan ;
Ajilforoushan, Naeem ;
Yang, Hanqing ;
Zhang, Yu ;
Rambach, Jason .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 :1700-1711