Certifiable Object Pose Estimation: Foundations, Learning Models, and Self-Training

被引:3
作者
Talak, Rajat [1 ]
Peng, Lisa R. [1 ,2 ]
Carlone, Luca [1 ]
机构
[1] MIT, Lab Informat & Decis Syst, Cambridge, MA 02139 USA
[2] Ample, San Francisco, CA 94107 USA
基金
美国国家科学基金会;
关键词
Certifiable models; computer vision; 3D robot vision; object pose estimation; safe perception; self-supervised learning; PREDICTION;
D O I
10.1109/TRO.2023.3271568
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In this article, we consider a certifiable object pose estimation problem, where-given a partial point cloud of an object-the goal is to not only estimate the object pose, but also provide a certificate of correctness for the resulting estimate. Our first contribution is a general theory of certification for end-to-end perception models. In particular, we introduce the notion of ?-correctness, which bounds the distance between an estimate and the ground truth. We then show that ?-correctness can be assessed by implementing two certificates: 1) a certificate of observable correctness, which asserts if the model output is consistent with the input data and prior information; and 2) a certificate of nondegeneracy, which asserts whether the input data are sufficient to compute a unique estimate. Our second contribution is to apply this theory and design a new learning-based certifiable pose estimator. In particular, we propose C-3PO, a semantic-keypoint-based pose estimation model, augmented with the two certificates, to solve the certifiable pose estimation problem. C-3PO also includes a keypoint corrector, implemented as a differentiable optimization layer, that can correct large detection errors (e.g., due to the sim-to-real gap). Our third contribution is a novel self-supervised training approach that uses our certificate of observable correctness to provide the supervisory signal to C-3PO during training. In it, the model trains only on the observably correct input-output pairs produced in each batch and at each iteration. As training progresses, we see that the observably correct input-output pairs grow, eventually reaching near 100% in many cases. We conduct extensive experiments to evaluate the performance of the corrector, the certification, and the proposed self-supervised training using the ShapeNet and YCB datasets. The experiments show that 1) standard semantic-keypoint-based methods (which constitute the backbone of C-3PO) outperform more recent alternatives in challenging problem instances; 2) C-3PO further improves performance and significantly outperforms all the baselines; and 3) C-3PO's certificates are able to discern correct pose estimates.(1)
引用
收藏
页码:2805 / 2824
页数:20
相关论文
共 35 条
  • [21] Estimation of local scour around monopile foundations for offshore structures using machine learning models
    Guan, Da-Wei
    Xie, Yu-Xuan
    Chiew, Yee-Meng
    Ding, Fan
    Ferradosa, Tiago Fazeres
    Hong, Jianhao
    OCEAN ENGINEERING, 2024, 296
  • [22] Adversarial imitation learning-based network for category-level 6D object pose estimation
    Sun, Shantong
    Bao, Xu
    Kaushik, Aryan
    MACHINE VISION AND APPLICATIONS, 2024, 35 (05)
  • [23] Switchable-Encoder-Based Self-Supervised Learning Framework for Monocular Depth and Pose Estimation
    Kim, Junoh
    Gao, Rui
    Park, Jisun
    Yoon, Jinsoo
    Cho, Kyungeun
    REMOTE SENSING, 2023, 15 (24)
  • [24] Multi-View 3D Human Pose Estimation with Self-Supervised Learning
    Chang, Inho
    Park, Min-Gyu
    Kim, Jaewoo
    Yoon, Ju Hong
    3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (IEEE ICAIIC 2021), 2021, : 255 - 257
  • [25] Fixed-Wing UAV Pose Estimation Using a Self-Organizing Map and Deep Learning
    Santos, Nuno Pessanha
    ROBOTICS, 2024, 13 (08)
  • [26] SynPo-Net-Accurate and Fast CNN-Based 6DoF Object Pose Estimation Using Synthetic Training
    Su, Yongzhi
    Rambach, Jason
    Pagani, Alain
    Stricker, Didier
    SENSORS, 2021, 21 (01) : 1 - 16
  • [27] Detecting Object Surface Keypoints From a Single RGB Image via Deep Learning Network for 6-DoF Pose Estimation
    Aing, Lee
    Lie, Wen-Nung
    IEEE ACCESS, 2021, 9 : 77729 - 77741
  • [28] Category-Level 6D Object Pose and Size Estimation Using Self-supervised Deep Prior Deformation Networks
    Lin, Jiehong
    Wei, Zewei
    Ding, Changxing
    Jia, Kui
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 19 - 34
  • [29] Deep Learning-Based Vehicle Orientation Estimation with Analysis of Training Models on Virtual-Worlds
    Park, Jongkuk
    Yoon, Yookhyun
    Park, Jahnghyon
    2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS (IISA), 2019, : 292 - 298
  • [30] Faster Training of Edge-attention Aided 6D Pose Estimation Model using Transfer Learning and Small Customized Dataset
    Anju, S.
    Mishra, Deepak
    Rani, Sheeba J.
    2021 SIXTH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2021, : 62 - 67