6DoF Object Pose and Focal Length Estimation from Single RGB Images in Uncontrolled Environments

被引:1
|
作者
Manawadu, Mayura [1 ]
Park, Soon-Yong [1 ]
机构
[1] Kyungpook Natl Univ, Grad Sch Elect & Elect Engn, Daegu 41566, South Korea
基金
新加坡国家研究基金会;
关键词
6DoF; pose estimation; focal length; uncontrolled RGB images; XR; RECOGNITION;
D O I
10.3390/s24175474
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Accurate 6DoF (degrees of freedom) pose and focal length estimation are important in extended reality (XR) applications, enabling precise object alignment and projection scaling, thereby enhancing user experiences. This study focuses on improving 6DoF pose estimation using single RGB images of unknown camera metadata. Estimating the 6DoF pose and focal length from an uncontrolled RGB image, obtained from the internet, is challenging because it often lacks crucial metadata. Existing methods such as FocalPose and Focalpose++ have made progress in this domain but still face challenges due to the projection scale ambiguity between the translation of an object along the z-axis (tz) and the camera's focal length. To overcome this, we propose a two-stage strategy that decouples the projection scaling ambiguity in the estimation of z-axis translation and focal length. In the first stage, tz is set arbitrarily, and we predict all the other pose parameters and focal length relative to the fixed tz. In the second stage, we predict the true value of tz while scaling the focal length based on the tz update. The proposed two-stage method reduces projection scale ambiguity in RGB images and improves pose estimation accuracy. The iterative update rules constrained to the first stage and tailored loss functions including Huber loss in the second stage enhance the accuracy in both 6DoF pose and focal length estimation. Experimental results using benchmark datasets show significant improvements in terms of median rotation and translation errors, as well as better projection accuracy compared to the existing state-of-the-art methods. In an evaluation across the Pix3D datasets (chair, sofa, table, and bed), the proposed two-stage method improves projection accuracy by approximately 7.19%. Additionally, the incorporation of Huber loss resulted in a significant reduction in translation and focal length errors by 20.27% and 6.65%, respectively, in comparison to the Focalpose++ method.
引用
收藏
页数:25
相关论文
共 50 条
  • [41] RGB-D object pose estimation in unstructured environments
    Choi, Changhyun
    Christensen, Henrik I.
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 75 : 595 - 613
  • [42] 6DoF Pose Estimation for Industrial Manipulation Based on Synthetic Data
    Brucker, Manuel
    Durner, Maximilian
    Marton, Zoltan-Csaba
    Balint-Benczedi, Ferenc
    Sundermeyer, Martin
    Triebel, Rudolph
    PROCEEDINGS OF THE 2018 INTERNATIONAL SYMPOSIUM ON EXPERIMENTAL ROBOTICS, 2020, 11 : 675 - 684
  • [43] Self-Supervised Domain Adaptation for 6DoF Pose Estimation
    Jin, Juseong
    Jeong, Eunju
    Cho, Joonmyun
    Kim, Young-Gon
    IEEE ACCESS, 2024, 12 : 101528 - 101535
  • [44] Ground Plane Polling for 6DoF Pose Estimation of Objects on the Road
    Rangesh, Akshay
    Trivedi, Mohan Manubhai
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2020, 5 (03): : 449 - 460
  • [45] A novel 6DoF pose estimation method using transformer fusion
    Wang, Huafeng
    Zhang, Haodu
    Liu, Wanquan
    Hu, Zhimin
    Gao, Haoqi
    Lv, Weifeng
    Gu, Xianfeng
    PATTERN RECOGNITION, 2025, 162
  • [46] Enhancing object pose estimation for RGB images in cluttered scenes
    Al-Selwi, Metwalli
    Ning, Huang
    Gao, Yin
    Chao, Yan
    Li, Qiming
    Li, Jun
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [47] 6-DoF Pose Estimation and CAD Model Retrieval for XR Interface from a Single RGB Image
    Park, Sieun
    Jeong, Wonje
    Park, Soon-Yong
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED VISUAL INTERFACES, AVI 2024, 2024,
  • [48] DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-scale Consistency
    Yang, Zongxin
    Yu, Xin
    Yang, Yi
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3906 - 3915
  • [49] 6DOF Needle Pose Estimation for Robot-Assisted Vitreoretinal Surgery
    Zhou, Mingchuan
    Hao, Xing
    Eslami, Abouzar
    Huang, Kai
    Cai, Caixia
    Lohmann, Chris P.
    Navab, Nassir
    Knoll, Alois
    Nasseri, M. Ali
    IEEE ACCESS, 2019, 7 : 63113 - 63122
  • [50] 6D Pose Estimation of Transparent Object From Single RGB Image for Robotic Manipulation
    Byambaa, Munkhtulga
    Koutaki, Gou
    Choimaa, Lodoiravsal
    IEEE ACCESS, 2022, 10 : 114897 - 114906