Challenges for Monocular 6-D Object Pose Estimation in Robotics

被引:3
|
作者
Thalhammer, Stefan [1 ]
Bauer, Dominik [2 ]
Hoenig, Peter [3 ]
Weibel, Jean-Baptiste [3 ]
Garcia-Rodriguez, Jose [4 ]
Vincze, Markus [3 ]
机构
[1] UAS Tech Vienna, Ind Engn Dept, A-1200 Vienna, Austria
[2] Columbia Univ, Columbia Artificial Intelligence & Robot Lab, New York, NY 10027 USA
[3] TU Wien, Automat & Control Inst, A-1040 Vienna, Austria
[4] Univ Alicante, Dept Comp Technol, Alicante 03690, Spain
基金
奥地利科学基金会; 欧盟地平线“2020”;
关键词
Robots; Pose estimation; Training; Standards; Sensors; Surveys; Reviews; 6-D object pose estimation; monocular; open challenges; perception for manipulation; scene understanding; LARGE-SCALE BENCHMARK; 6D POSE; DATASET;
D O I
10.1109/TRO.2024.3433870
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Object pose estimation is a core perception task that enables, for example, object manipulation and scene understanding. The widely available, inexpensive, and high-resolution RGB sensors and CNNs that allow for fast inference make monocular approaches especially well-suited for robotics applications. We observe that previous surveys establish the state of the art for varying modalities, single- and multiview settings, and datasets and metrics that consider a multitude of applications. We argue, however, that those works' broad scope hinders the identification of open challenges that are specific to monocular approaches and the derivation of promising future challenges for their application in robotics. By providing a unified view on recent publications from both robotics and computer vision, we find that occlusion handling, pose representations, and formalizing and improving category-level pose estimation are still fundamental challenges that are highly relevant for robotics. Moreover, to further improve robotic performance, large object sets, novel objects, refractive materials, and uncertainty estimates are central and largely unsolved open challenges. In order to address them, ontological reasoning, deformability handling, scene-level reasoning, realistic datasets, and the ecological footprint of algorithms need to be improved.
引用
收藏
页码:4065 / 4084
页数:20
相关论文
共 50 条
  • [1] MODELING AND INTERPRETING 6-D OBJECT POSE ESTIMATION
    Soler, Diego
    Hirata, Roberto, Jr.
    Espadoto, Mateus
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2325 - 2329
  • [2] ContourPose: Monocular 6-D Pose Estimation Method for Reflective Textureless Metal Parts
    He, Zaixing
    Li, Quanzhi
    Zhao, Xinyue
    Wang, Jin
    Shen, Huarong
    Zhang, Shuyou
    Tan, Jianrong
    IEEE TRANSACTIONS ON ROBOTICS, 2023, 39 (05) : 4037 - 4050
  • [3] 6-D Object Pose Estimation Using Multiscale Point Cloud Transformer
    Zhou, Guangliang
    Wang, Deming
    Yan, Yi
    Liu, Chengju
    Chen, Qijun
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [4] 6-D Object Pose Estimation Using Multiscale Point Cloud Transformer
    Zhou, Guangliang
    Wang, Deming
    Yan, Yi
    Liu, Chengju
    Chen, Qijun
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [5] 6-D pose estimation method for large gear structure assembly using monocular vision
    Zhou, Kuai
    Huang, Xiang
    Li, Shuanggao
    Li, Hangyu
    Kong, Shengjie
    MEASUREMENT, 2021, 183
  • [6] WireframePose: Monocular 6-D Pose Estimation of Metal Parts Based on Wireframe Extraction and Matching
    Liu, Ze'An
    Wang, Xuanyin
    Pu, Bin
    Tang, Jixiang
    Sun, Jiaqi
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [7] Sparse Template-Based 6-D Pose Estimation of Metal Parts Using a Monocular Camera
    He, Zaixing
    Jiang, Zhiwei
    Zhao, Xinyue
    Zhang, Shuyou
    Wu, Chenrui
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2020, 67 (01) : 390 - 401
  • [8] Efficient MSPSO Sampling for Object Detection and 6-D Pose Estimation in 3-D Scenes
    Xing, Xuejun
    Guo, Jianwei
    Nan, Liangliang
    Gu, Qingyi
    Zhang, Xiaopeng
    Yan, Dong-Ming
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2022, 69 (10) : 10281 - 10291
  • [9] 6-D Object Pose Estimation Based on Point Pair Matching for Robotic Grasp Detection
    Yu, Sheng
    Zhai, Di-Hua
    Zhan, Yufeng
    Wang, Wencai
    Guan, Yuyin
    Xia, Yuanqing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [10] Category-Level 6-D Object Pose Estimation With Shape Deformation for Robotic Grasp Detection
    Yu, Sheng
    Zhai, Di-Hua
    Guan, Yuyin
    Xia, Yuanqing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1857 - 1871