Synthetic Depth Image-Based Category-Level Object Pose Estimation With Effective Pose Decoupling and Shape Optimization

被引:0
作者
Yu, Sheng [1 ]
Zhai, Di-Hua [1 ]
Xia, Yuanqing [1 ,2 ]
机构
[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China
[2] Zhongyuan Univ Technol, Sch Automat, Zhengzhou 450007, Henan, Peoples R China
基金
中国国家自然科学基金;
关键词
Pose estimation; Three-dimensional displays; Point cloud compression; Solid modeling; Shape; Feature extraction; Computational modeling; 3-D reconstruction; object detection; point sampling; pose estimation; shape optimization;
D O I
10.1109/TIM.2024.3427799
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Category-level object pose estimation is a crucial task in the field of computer vision and finds numerous applications. However, the presence of unknown objects, significant shape, and scale variations within the same category pose challenges in this task. To address these challenges and achieve efficient and accurate category-level object pose estimation, we present EffectPose in this article. We first observe that objects of the same category often possess similar key regions, such as handles on cups. These key regions can establish correspondences for spatial poses, enabling pose estimation. To facilitate this, we employ a segmentation network to divide point clouds into multiple parts and map them to a shared latent space. Subsequently, by considering the correspondences between predicted implicit models and real point clouds for various key regions, we accomplish pose estimation. Since real object point clouds are typically dense and contain outliers, we propose a novel point cloud sampling network that can accurately select representative points for efficient correspondence construction. Furthermore, we decouple the scale and pose of objects based on the SIM(3) invariant descriptor and propose an online pose optimization method using this descriptor. This method enables online prediction and optimization of poses. Finally, to enhance pose estimation accuracy, we introduce a distance-weighted pose optimization method for pose refinement and adjustment. Experimental results demonstrate that our proposed method achieves efficient pose estimation and generalization by utilizing only synthetic depth images and a minimal number of network parameters, surpassing the performance of most existing methods.
引用
收藏
页数:18
相关论文
共 46 条
  • [21] Refined Prior Guided Category-Level 6D Pose Estimation and Its Application on Robotic Grasping
    Sun, Huimin
    Zhang, Yilin
    Sun, Honglin
    Hashimoto, Kenji
    APPLIED SCIENCES-BASEL, 2024, 14 (17):
  • [22] Incorporating structure from motion uncertainty into image-based pose estimation
    Ludington, Ben T.
    Brown, Andrew P.
    Sheffler, Michael J.
    Taylor, Clark N.
    Berardi, Stephen
    GEOSPATIAL INFORMATICS, FUSION, AND MOTION VIDEO ANALYTICS V, 2015, 9473
  • [23] A Transformer-Based Network for Full Object Pose Estimation with Depth Refinement
    Abdulsalam, Mahmoud
    Ahiska, Kenan
    Aouf, Nabil
    ADVANCED INTELLIGENT SYSTEMS, 2024, 6 (10)
  • [24] Resolving Symmetry Ambiguity in Correspondence-Based Methods for Instance-Level Object Pose Estimation
    Lin, Yongliang
    Su, Yongzhi
    Inuganti, Sandeep
    Di, Yan
    Ajilforoushan, Naeem
    Yang, Hanqing
    Zhang, Yu
    Rambach, Jason
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 1700 - 1711
  • [25] CatTrack: Single-Stage Category-Level 6D Object Pose Tracking via Convolution and Vision Transformer
    Yu, Sheng
    Zhai, Di-Hua
    Xia, Yuanqing
    Li, Dong
    Zhao, Shiqi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1665 - 1680
  • [26] RGB-D Image-based Pose Estimation with Monte Carlo Localization
    Li, Ming
    Qin, Hao
    Huang, May
    Cao, Jian
    Zhang, Xing
    2017 3RD INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2017, : 109 - 114
  • [27] Image-Based Tactile Deformation Simulation and Pose Estimation for Robot Skill Learning
    Fu, Chenfeng
    Li, Longnan
    Gao, Yuan
    Wan, Weiwei
    Harada, Kensuke
    Lu, Zhenyu
    Yang, Chenguang
    APPLIED SCIENCES-BASEL, 2025, 15 (03):
  • [28] MR-CapsNet: A Deep Learning Algorithm for Image-Based Head Pose Estimation on CapsNet
    Fang, Hao
    Liu, Jun-Qing
    Xie, Kai
    Wu, Peng
    Zhang, Xin-Yu
    Wen, Chang
    He, Jian-Biao
    IEEE ACCESS, 2021, 9 : 141245 - 141257
  • [29] Appearance-based object pose estimation and misestimation detection by shape fitness
    Nishikawa, Ryo
    Noguchi, Haruka
    Yamazaki, Taro
    Nakamura, Akio
    Seimitsu Kogaku Kaishi/Journal of the Japan Society for Precision Engineering, 2013, 79 (11): : 1050 - 1057
  • [30] Lie-X: Depth Image Based Articulated Object Pose Estimation, Tracking, and Action Recognition on Lie Groups
    Xu, Chi
    Govindarajan, Lakshmi Narasimhan
    Zhang, Yu
    Stewart, James
    Bichler, Zoe
    Jesuthasan, Suresh
    Claridge-Chang, Adam
    Mathuru, Ajay Sriram
    Tang, Wenlong
    Zhu, Peixin
    Cheng, Li
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 123 (03) : 454 - 478