Synthetic Depth Image-Based Category-Level Object Pose Estimation With Effective Pose Decoupling and Shape Optimization

被引:4
作者
Yu, Sheng [1 ]
Zhai, Di-Hua [1 ]
Xia, Yuanqing [1 ,2 ]
机构
[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China
[2] Zhongyuan Univ Technol, Sch Automat, Zhengzhou 450007, Henan, Peoples R China
基金
中国国家自然科学基金;
关键词
Pose estimation; Three-dimensional displays; Point cloud compression; Solid modeling; Shape; Feature extraction; Computational modeling; 3-D reconstruction; object detection; point sampling; pose estimation; shape optimization;
D O I
10.1109/TIM.2024.3427799
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Category-level object pose estimation is a crucial task in the field of computer vision and finds numerous applications. However, the presence of unknown objects, significant shape, and scale variations within the same category pose challenges in this task. To address these challenges and achieve efficient and accurate category-level object pose estimation, we present EffectPose in this article. We first observe that objects of the same category often possess similar key regions, such as handles on cups. These key regions can establish correspondences for spatial poses, enabling pose estimation. To facilitate this, we employ a segmentation network to divide point clouds into multiple parts and map them to a shared latent space. Subsequently, by considering the correspondences between predicted implicit models and real point clouds for various key regions, we accomplish pose estimation. Since real object point clouds are typically dense and contain outliers, we propose a novel point cloud sampling network that can accurately select representative points for efficient correspondence construction. Furthermore, we decouple the scale and pose of objects based on the SIM(3) invariant descriptor and propose an online pose optimization method using this descriptor. This method enables online prediction and optimization of poses. Finally, to enhance pose estimation accuracy, we introduce a distance-weighted pose optimization method for pose refinement and adjustment. Experimental results demonstrate that our proposed method achieves efficient pose estimation and generalization by utilizing only synthetic depth images and a minimal number of network parameters, surpassing the performance of most existing methods.
引用
收藏
页数:18
相关论文
共 50 条
[31]   CatTrack: Single-Stage Category-Level 6D Object Pose Tracking via Convolution and Vision Transformer [J].
Yu, Sheng ;
Zhai, Di-Hua ;
Xia, Yuanqing ;
Li, Dong ;
Zhao, Shiqi .
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 :1665-1680
[32]   RGB-D Image-based Pose Estimation with Monte Carlo Localization [J].
Li, Ming ;
Qin, Hao ;
Huang, May ;
Cao, Jian ;
Zhang, Xing .
2017 3RD INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2017, :109-114
[33]   Image-Based Tactile Deformation Simulation and Pose Estimation for Robot Skill Learning [J].
Fu, Chenfeng ;
Li, Longnan ;
Gao, Yuan ;
Wan, Weiwei ;
Harada, Kensuke ;
Lu, Zhenyu ;
Yang, Chenguang .
APPLIED SCIENCES-BASEL, 2025, 15 (03)
[34]   MR-CapsNet: A Deep Learning Algorithm for Image-Based Head Pose Estimation on CapsNet [J].
Fang, Hao ;
Liu, Jun-Qing ;
Xie, Kai ;
Wu, Peng ;
Zhang, Xin-Yu ;
Wen, Chang ;
He, Jian-Biao .
IEEE ACCESS, 2021, 9 :141245-141257
[35]   Appearance-based object pose estimation and misestimation detection by shape fitness [J].
Nishikawa, Ryo ;
Noguchi, Haruka ;
Yamazaki, Taro ;
Nakamura, Akio .
Seimitsu Kogaku Kaishi/Journal of the Japan Society for Precision Engineering, 2013, 79 (11) :1050-1057
[36]   Lie-X: Depth Image Based Articulated Object Pose Estimation, Tracking, and Action Recognition on Lie Groups [J].
Xu, Chi ;
Govindarajan, Lakshmi Narasimhan ;
Zhang, Yu ;
Stewart, James ;
Bichler, Zoe ;
Jesuthasan, Suresh ;
Claridge-Chang, Adam ;
Mathuru, Ajay Sriram ;
Tang, Wenlong ;
Zhu, Peixin ;
Cheng, Li .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 123 (03) :454-478
[37]   Lie-X: Depth Image Based Articulated Object Pose Estimation, Tracking, and Action Recognition on Lie Groups [J].
Chi Xu ;
Lakshmi Narasimhan Govindarajan ;
Yu Zhang ;
Li Cheng .
International Journal of Computer Vision, 2017, 123 :454-478
[38]   Image-based aircraft pose estimation: a comparison of simulations and real-world data [J].
Breuers, MG ;
de Reus, N .
AUTOMATIC TARGET RECOGNITION XI, 2001, 4379 :472-479
[39]   Depth Based Object Detection from Partial Pose Estimation of Symmetric Objects [J].
Barnea, Ehud ;
Ben-Shahar, Ohad .
COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :377-390
[40]   RGB-D-based categorical object pose and shape estimation: Methods, datasets, and evaluation [J].
Bruns, Leonard ;
Jensfelt, Patric .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2023, 168