Certifiable Object Pose Estimation: Foundations, Learning Models, and Self-Training

被引:3
作者
Talak, Rajat [1 ]
Peng, Lisa R. [1 ,2 ]
Carlone, Luca [1 ]
机构
[1] MIT, Lab Informat & Decis Syst, Cambridge, MA 02139 USA
[2] Ample, San Francisco, CA 94107 USA
基金
美国国家科学基金会;
关键词
Certifiable models; computer vision; 3D robot vision; object pose estimation; safe perception; self-supervised learning; PREDICTION;
D O I
10.1109/TRO.2023.3271568
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In this article, we consider a certifiable object pose estimation problem, where-given a partial point cloud of an object-the goal is to not only estimate the object pose, but also provide a certificate of correctness for the resulting estimate. Our first contribution is a general theory of certification for end-to-end perception models. In particular, we introduce the notion of ?-correctness, which bounds the distance between an estimate and the ground truth. We then show that ?-correctness can be assessed by implementing two certificates: 1) a certificate of observable correctness, which asserts if the model output is consistent with the input data and prior information; and 2) a certificate of nondegeneracy, which asserts whether the input data are sufficient to compute a unique estimate. Our second contribution is to apply this theory and design a new learning-based certifiable pose estimator. In particular, we propose C-3PO, a semantic-keypoint-based pose estimation model, augmented with the two certificates, to solve the certifiable pose estimation problem. C-3PO also includes a keypoint corrector, implemented as a differentiable optimization layer, that can correct large detection errors (e.g., due to the sim-to-real gap). Our third contribution is a novel self-supervised training approach that uses our certificate of observable correctness to provide the supervisory signal to C-3PO during training. In it, the model trains only on the observably correct input-output pairs produced in each batch and at each iteration. As training progresses, we see that the observably correct input-output pairs grow, eventually reaching near 100% in many cases. We conduct extensive experiments to evaluate the performance of the corrector, the certification, and the proposed self-supervised training using the ShapeNet and YCB datasets. The experiments show that 1) standard semantic-keypoint-based methods (which constitute the backbone of C-3PO) outperform more recent alternatives in challenging problem instances; 2) C-3PO further improves performance and significantly outperforms all the baselines; and 3) C-3PO's certificates are able to discern correct pose estimates.(1)
引用
收藏
页码:2805 / 2824
页数:20
相关论文
共 35 条
  • [1] Bridging the Domain Gap in Satellite Pose Estimation: A Self-Training Approach Based on Geometrical Constraints
    Wang, Zi
    Chen, Minglin
    Guo, Yulan
    Li, Zhang
    Yu, Qifeng
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (03) : 2500 - 2514
  • [2] SELF-SUPERVISED LEARNING FOR HUMAN POSE ESTIMATION IN SPORTS
    Ludwig, Katja
    Scherer, Sebastian
    Einfalt, Moritz
    Lienhart, Rainer
    2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,
  • [3] A Study on Systematic Improvement of Transformer Models for Object Pose Estimation
    Lee, Jungwoo
    Suh, Jinho
    SENSORS, 2025, 25 (04)
  • [4] ASBERT: ASR-SPECIFIC SELF-SUPERVISED LEARNING WITH SELF-TRAINING
    Kim, Hyung Yong
    Kim, Byeong-Yeol
    Yoo, Seung Woo
    Lim, Youshin
    Lim, Yunkyu
    Lee, Hanbin
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 9 - 14
  • [5] OTOT: An Online Training and Offline Testing System for 6D Object Pose Estimation
    Yuan, Yilin
    Jiang, Qian
    Mu, Quan
    Jia, Wenchao
    Fu, Boya
    He, Renzhi
    Wen, Jian
    Liu, Fei
    Mao, Qin
    Zhou, Mingliang
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2025, 39 (02)
  • [6] Self-Generated Dataset for Category and Pose Estimation of Deformable Object
    Hou, Yew Cheong
    Sahari, Khairul Salleh Mohamed
    ICAROB 2018: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS, 2018, : 232 - 235
  • [7] Self-Generated Dataset for Category and Pose Estimation of Deformable Object
    Hou, Yew Cheong
    Sahari, Khairul Salleh Mohamed
    JOURNAL OF ROBOTICS NETWORKING AND ARTIFICIAL LIFE, 2019, 5 (04): : 217 - 222
  • [8] SS-Pose: Self-Supervised 6-D Object Pose Representation Learning Without Rendering
    Mu, Fengjun
    Huang, Rui
    Zhang, Jingting
    Zou, Chaobin
    Shi, Kecheng
    Sun, Shixiang
    Zhan, Huayi
    Zhao, Pengbo
    Qiu, Jing
    Cheng, Hong
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (12) : 13665 - 13675
  • [9] Robotic Grasp Detection Based on Category-Level Object Pose Estimation With Self-Supervised Learning
    Yu, Sheng
    Zhai, Di-Hua
    Xia, Yuanqing
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024, 29 (01) : 625 - 635
  • [10] RemixIT: Continual Self-Training of Speech Enhancement Models via Bootstrapped Remixing
    Tzinis, Efthymios
    Adi, Yossi
    Ithapu, Vamsi K.
    Xu, Buye
    Smaragdis, Paris
    Kumar, Anurag
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (06) : 1329 - 1341