Challenges for Monocular 6-D Object Pose Estimation in Robotics

被引：3

作者：

Thalhammer, Stefan ^{[1
]}

Bauer, Dominik ^{[2
]}

Hoenig, Peter ^{[3
]}

Weibel, Jean-Baptiste ^{[3
]}

Garcia-Rodriguez, Jose ^{[4
]}

Vincze, Markus ^{[3
]}

机构：

[1] UAS Tech Vienna, Ind Engn Dept, A-1200 Vienna, Austria

[2] Columbia Univ, Columbia Artificial Intelligence & Robot Lab, New York, NY 10027 USA

[3] TU Wien, Automat & Control Inst, A-1040 Vienna, Austria

[4] Univ Alicante, Dept Comp Technol, Alicante 03690, Spain

来源：

IEEE TRANSACTIONS ON ROBOTICS | 2024年 / 40卷

基金：

奥地利科学基金会; 欧盟地平线“2020”;

关键词：

Robots; Pose estimation; Training; Standards; Sensors; Surveys; Reviews; 6-D object pose estimation; monocular; open challenges; perception for manipulation; scene understanding; LARGE-SCALE BENCHMARK; 6D POSE; DATASET;

D O I：

10.1109/TRO.2024.3433870

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Object pose estimation is a core perception task that enables, for example, object manipulation and scene understanding. The widely available, inexpensive, and high-resolution RGB sensors and CNNs that allow for fast inference make monocular approaches especially well-suited for robotics applications. We observe that previous surveys establish the state of the art for varying modalities, single- and multiview settings, and datasets and metrics that consider a multitude of applications. We argue, however, that those works' broad scope hinders the identification of open challenges that are specific to monocular approaches and the derivation of promising future challenges for their application in robotics. By providing a unified view on recent publications from both robotics and computer vision, we find that occlusion handling, pose representations, and formalizing and improving category-level pose estimation are still fundamental challenges that are highly relevant for robotics. Moreover, to further improve robotic performance, large object sets, novel objects, refractive materials, and uncertainty estimates are central and largely unsolved open challenges. In order to address them, ontological reasoning, deformability handling, scene-level reasoning, realistic datasets, and the ecological footprint of algorithms need to be improved.

引用

页码：4065 / 4084

页数：20

共 50 条

[11] Category-Level 6-D Object Pose Estimation With Shape Deformation for Robotic Grasp Detection
Yu, Sheng
Zhai, Di-Hua
Guan, Yuyin
Xia, Yuanqing
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1857 - 1871
[12] Active 6-D position-pose estimation of a spatial circle using monocular eye-in-hand system
Ma, Xin
Feng, Junbing
Li, Yibin
Tan, Jindong
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2018, 15 (01):
[13] Robust 6-D Pose Estimation of the UAV Based on Hybrid Features
Li, Chujun
Wu, Yiyang
Sheng, Zhuge
Yang, Xia
Lin, Bin
Xu, Xiangpeng
Zhang, Xiaohu
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
[14] MH6D: Multi-Hypothesis Consistency Learning for Category-Level 6-D Object Pose Estimation
Liu, Jian
Sun, Wei
Liu, Chongpei
Yang, Hui
Zhang, Xing
Mian, Ajmal
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 14
[15] Optimal Model-Based 6-D Object Pose Estimation With Structured-Light Depth Sensors
Landau, Michael J.
Beling, Peter A.
IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2017, 3 (01) : 58 - 73
[16] YOLO-6D-Pose: Enhancing YOLO for Single-Stage Monocular Multi-Object 6D Pose Estimation
Maji, Debapriya
Nagori, Soyeb
Mathew, Manu
Poddar, Deepak
2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 1616 - 1625
[17] 6D Object Pose Estimation from Approximate 3D Models for Orbital Robotics
Ulmer, Maximilian
Durner, Maximilian
Sundermeyer, Martin
Stoiber, Manuel
Triebel, Rudolph
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 10749 - 10756
[18] Occlusion-Aware Self-Supervised Monocular 6D Object Pose Estimation
Wang, Gu
Manhardt, Fabian
Liu, Xingyu
Ji, Xiangyang
Tombari, Federico
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) : 1788 - 1803
[19] Prior Geometry Guided Direct Regression Network for Monocular 6D Object Pose Estimation
Liu, Chongpei
Sun, Wei
Zhang, Keyi
Liu, Jian
Zhang, Xing
Fan, Shimeng
2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6241 - 6246
[20] W6DNet: Weakly Supervised Domain Adaptation for Monocular Vehicle 6-D Pose Estimation With 3-D Priors and Synthetic Data
Lyu, Yangxintong
Royen, Remco
Munteanu, Adrian
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 13

← 1 2 3 4 5 →