Synthetic Depth Image-Based Category-Level Object Pose Estimation With Effective Pose Decoupling and Shape Optimization

被引:4
作者
Yu, Sheng [1 ]
Zhai, Di-Hua [1 ]
Xia, Yuanqing [1 ,2 ]
机构
[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China
[2] Zhongyuan Univ Technol, Sch Automat, Zhengzhou 450007, Henan, Peoples R China
基金
中国国家自然科学基金;
关键词
Pose estimation; Three-dimensional displays; Point cloud compression; Solid modeling; Shape; Feature extraction; Computational modeling; 3-D reconstruction; object detection; point sampling; pose estimation; shape optimization;
D O I
10.1109/TIM.2024.3427799
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Category-level object pose estimation is a crucial task in the field of computer vision and finds numerous applications. However, the presence of unknown objects, significant shape, and scale variations within the same category pose challenges in this task. To address these challenges and achieve efficient and accurate category-level object pose estimation, we present EffectPose in this article. We first observe that objects of the same category often possess similar key regions, such as handles on cups. These key regions can establish correspondences for spatial poses, enabling pose estimation. To facilitate this, we employ a segmentation network to divide point clouds into multiple parts and map them to a shared latent space. Subsequently, by considering the correspondences between predicted implicit models and real point clouds for various key regions, we accomplish pose estimation. Since real object point clouds are typically dense and contain outliers, we propose a novel point cloud sampling network that can accurately select representative points for efficient correspondence construction. Furthermore, we decouple the scale and pose of objects based on the SIM(3) invariant descriptor and propose an online pose optimization method using this descriptor. This method enables online prediction and optimization of poses. Finally, to enhance pose estimation accuracy, we introduce a distance-weighted pose optimization method for pose refinement and adjustment. Experimental results demonstrate that our proposed method achieves efficient pose estimation and generalization by utilizing only synthetic depth images and a minimal number of network parameters, surpassing the performance of most existing methods.
引用
收藏
页数:18
相关论文
共 50 条
[41]   I-HS4PCS: Object 6D Pose Estimation Method Based on Harris3D-ikdTree Optimization [J].
Zhang, Ziang ;
Li, Hongsheng .
IEEE ACCESS, 2024, 12 :138018-138026
[42]   Physics-based scene-level reasoning for object pose estimation in clutter [J].
Mitash, Chaitanya ;
Boularias, Abdeslam ;
Bekris, Kostas .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2022, 41 (06) :615-636
[43]   An End-to-End Task-Simplified and Anchor-Guided Deep Learning Framework for Image-Based Head Pose Estimation [J].
Li, Jing ;
Wang, Jiang ;
Ullah, Farhan .
IEEE ACCESS, 2020, 8 :42458-42468
[44]   Single 2D Image-based 3D Face Reconstruction and Its Application in Pose Estimation [J].
Gong, Xun ;
Wang, Guoyin ;
Xiong, Lili .
FUNDAMENTA INFORMATICAE, 2009, 94 (02) :179-195
[45]   An image appearance based optimization scheme for monocular 6D pose estimation of SOR cabins [J].
Yin, Xuyue ;
Fan, Xiumin ;
Yang, Xu ;
Qiu, Shiguang .
OPTIK, 2019, 199
[46]   Depth-Based 3D Face Reconstruction and Pose Estimation Using Shape-Preserving Domain Adaptation [J].
Zhong Y. ;
Pei Y. ;
Li P. ;
Guo Y. ;
Ma G. ;
Liu M. ;
Bai W. ;
Wu W. ;
Zha H. .
Pei, Yuru (peiyuru@cis.pku.edu.cn), 1600, Institute of Electrical and Electronics Engineers Inc. (03) :6-15
[47]   Image-based occupancy positioning system using pose-estimation model for demand-oriented ventilation [J].
Wang, Huan ;
Wang, Guijin ;
Li, Xianting .
JOURNAL OF BUILDING ENGINEERING, 2021, 39
[48]   Confidence Indicators Based Pose Estimation for High-Quality 3D Reconstruction Using Depth Image [J].
Srinivasan, Ranga Ramanujam ;
Xia, Zhengyu ;
Kim, Joohee ;
Park, Young Soo .
2015 VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2015,
[49]   LA-Net: An End-to-End Category-Level Object Attitude Estimation Network Based on Multi-Scale Feature Fusion and an Attention Mechanism [J].
Wang, Jing ;
Liu, Guohan ;
Guo, Cheng ;
Ma, Qianglong ;
Song, Wanying .
ELECTRONICS, 2024, 13 (14)
[50]   Unreal mask: one-shot multi-object class-based pose estimation for robotic manipulation using keypoints with a synthetic dataset [J].
S. H. Zabihifar ;
A. N. Semochkin ;
E. V. Seliverstova ;
A. R. Efimov .
Neural Computing and Applications, 2021, 33 :12283-12300