HS-Pose: Hybrid Scope Feature Extraction for Category-level Object Pose Estimation

被引:22
作者
Zheng, Linfang [1 ,4 ]
Wang, Chen [1 ,2 ]
Sun, Yinghan [1 ]
Dasgupta, Esha [4 ]
Chen, Hua [1 ]
Leonardis, Ales [4 ]
Zhang, Wei [1 ,3 ]
Chang, Hyung Jin [4 ]
机构
[1] Southern Univ Sci & Technol, Dept Mech & Energy Engn, Shenzhen, Peoples R China
[2] Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[3] Peng Cheng Lab, Shenzhen, Peoples R China
[4] Univ Birmingham, Sch Comp Sci, Birmingham, England
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.01646
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we focus on the problem of category-level object pose estimation, which is challenging due to the large intra-category shape variation. 3D graph convolution (3D-GC) based methods have been widely used to extract local geometric features, but they have limitations for complex shaped objects and are sensitive to noise. Moreover, the scale and translation invariant properties of 3D-GC restrict the perception of an object's size and translation information. In this paper, we propose a simple network structure, the HS-layer, which extends 3D-GC to extract hybrid scope latent features from point cloud data for category-level object pose estimation tasks. The proposed HS-layer: 1) is able to perceive local-global geometric structure and global information, 2) is robust to noise, and 3) can encode size and translation information. Our experiments show that the simple replacement of the 3D-GC layer with the proposed HS-layer on the baseline method (GPV-Pose) achieves a significant improvement, with the performance increased by 14.5% on 5 degrees 2cm metric and 10.3% on IoU(75). Our method outperforms the state-of-the-art methods by a large margin (8.3% on 5 degrees 2cm, 6.9% on IoU(75)) on REAL275 dataset and runs in real-time (50 FPS)(1).
引用
收藏
页码:17163 / 17173
页数:11
相关论文
共 56 条
  • [31] A Low-Temperature Solution-Process High-k Dielectric for High-Performance Flexible Organic Field-Effect Transistors
    Mu, Qi
    Chen, Zheng
    Duan, Shuming
    Zhang, Xiaotao
    Ren, Xiaochen
    Hu, Wenping
    [J]. FRONTIERS IN MATERIALS, 2020, 7
  • [32] Nguyen Van Nguyen, 2022, TEMPLATES 3D OBJECT, P2
  • [33] Pix2Pose: Pixel-Wise Coordinate Regression of Objects for 6D Pose Estimation
    Park, Kiru
    Patten, Timothy
    Vincze, Markus
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7667 - 7676
  • [34] Peng Sida, 2019, IEEE C COMP VIS PATT, P2
  • [35] BB8: A Scalable, Accurate, Robust to Partial Occlusion Method for Predicting the 3D Poses of Challenging Objects without Using Depth
    Rad, Mahdi
    Lepetit, Vincent
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 3848 - 3856
  • [36] Sahin Caner, 2018, Category-level 6d object pose recovery in depth images, P2, Patent No. 08201812
  • [37] OSOP: A Multi-Stage One Shot Object Pose Estimation Framework
    Shugurov, Ivan
    Li, Fu
    Busam, Benjamin
    Ilic, Slobodan
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6825 - 6834
  • [38] Su Y., 2022, ZEBRAPOSE COARSE FIN
  • [39] Deep Multi-State Object Pose Estimation for Augmented Reality Assembly
    Su, Yongzhi
    Rambach, Jason
    Minaskan, Nareg
    Lesur, Paul
    Pagani, Alain
    Stricker, Didier
    [J]. ADJUNCT PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY (ISMAR-ADJUNCT 2019), 2019, : 222 - 227
  • [40] OnePose: One-Shot Object Pose Estimation without CAD Models
    Sun, Jiaming
    Wang, Zihao
    Zhang, Siyu
    He, Xingyi
    Zhao, Hongcheng
    Zhang, Guofeng
    Zhou, Xiaowei
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6815 - 6824