Challenges for Monocular 6-D Object Pose Estimation in Robotics

被引:3
|
作者
Thalhammer, Stefan [1 ]
Bauer, Dominik [2 ]
Hoenig, Peter [3 ]
Weibel, Jean-Baptiste [3 ]
Garcia-Rodriguez, Jose [4 ]
Vincze, Markus [3 ]
机构
[1] UAS Tech Vienna, Ind Engn Dept, A-1200 Vienna, Austria
[2] Columbia Univ, Columbia Artificial Intelligence & Robot Lab, New York, NY 10027 USA
[3] TU Wien, Automat & Control Inst, A-1040 Vienna, Austria
[4] Univ Alicante, Dept Comp Technol, Alicante 03690, Spain
基金
奥地利科学基金会; 欧盟地平线“2020”;
关键词
Robots; Pose estimation; Training; Standards; Sensors; Surveys; Reviews; 6-D object pose estimation; monocular; open challenges; perception for manipulation; scene understanding; LARGE-SCALE BENCHMARK; 6D POSE; DATASET;
D O I
10.1109/TRO.2024.3433870
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Object pose estimation is a core perception task that enables, for example, object manipulation and scene understanding. The widely available, inexpensive, and high-resolution RGB sensors and CNNs that allow for fast inference make monocular approaches especially well-suited for robotics applications. We observe that previous surveys establish the state of the art for varying modalities, single- and multiview settings, and datasets and metrics that consider a multitude of applications. We argue, however, that those works' broad scope hinders the identification of open challenges that are specific to monocular approaches and the derivation of promising future challenges for their application in robotics. By providing a unified view on recent publications from both robotics and computer vision, we find that occlusion handling, pose representations, and formalizing and improving category-level pose estimation are still fundamental challenges that are highly relevant for robotics. Moreover, to further improve robotic performance, large object sets, novel objects, refractive materials, and uncertainty estimates are central and largely unsolved open challenges. In order to address them, ontological reasoning, deformability handling, scene-level reasoning, realistic datasets, and the ecological footprint of algorithms need to be improved.
引用
收藏
页码:4065 / 4084
页数:20
相关论文
共 50 条
  • [11] Category-Level 6-D Object Pose Estimation With Shape Deformation for Robotic Grasp Detection
    Yu, Sheng
    Zhai, Di-Hua
    Guan, Yuyin
    Xia, Yuanqing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1857 - 1871
  • [12] Active 6-D position-pose estimation of a spatial circle using monocular eye-in-hand system
    Ma, Xin
    Feng, Junbing
    Li, Yibin
    Tan, Jindong
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2018, 15 (01):
  • [13] Robust 6-D Pose Estimation of the UAV Based on Hybrid Features
    Li, Chujun
    Wu, Yiyang
    Sheng, Zhuge
    Yang, Xia
    Lin, Bin
    Xu, Xiangpeng
    Zhang, Xiaohu
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [14] MH6D: Multi-Hypothesis Consistency Learning for Category-Level 6-D Object Pose Estimation
    Liu, Jian
    Sun, Wei
    Liu, Chongpei
    Yang, Hui
    Zhang, Xing
    Mian, Ajmal
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 14
  • [15] Optimal Model-Based 6-D Object Pose Estimation With Structured-Light Depth Sensors
    Landau, Michael J.
    Beling, Peter A.
    IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2017, 3 (01) : 58 - 73
  • [16] YOLO-6D-Pose: Enhancing YOLO for Single-Stage Monocular Multi-Object 6D Pose Estimation
    Maji, Debapriya
    Nagori, Soyeb
    Mathew, Manu
    Poddar, Deepak
    2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 1616 - 1625
  • [17] 6D Object Pose Estimation from Approximate 3D Models for Orbital Robotics
    Ulmer, Maximilian
    Durner, Maximilian
    Sundermeyer, Martin
    Stoiber, Manuel
    Triebel, Rudolph
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 10749 - 10756
  • [18] Occlusion-Aware Self-Supervised Monocular 6D Object Pose Estimation
    Wang, Gu
    Manhardt, Fabian
    Liu, Xingyu
    Ji, Xiangyang
    Tombari, Federico
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) : 1788 - 1803
  • [19] Prior Geometry Guided Direct Regression Network for Monocular 6D Object Pose Estimation
    Liu, Chongpei
    Sun, Wei
    Zhang, Keyi
    Liu, Jian
    Zhang, Xing
    Fan, Shimeng
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6241 - 6246
  • [20] W6DNet: Weakly Supervised Domain Adaptation for Monocular Vehicle 6-D Pose Estimation With 3-D Priors and Synthetic Data
    Lyu, Yangxintong
    Royen, Remco
    Munteanu, Adrian
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 13