6DoF Object Pose and Focal Length Estimation from Single RGB Images in Uncontrolled Environments

被引:1
|
作者
Manawadu, Mayura [1 ]
Park, Soon-Yong [1 ]
机构
[1] Kyungpook Natl Univ, Grad Sch Elect & Elect Engn, Daegu 41566, South Korea
基金
新加坡国家研究基金会;
关键词
6DoF; pose estimation; focal length; uncontrolled RGB images; XR; RECOGNITION;
D O I
10.3390/s24175474
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Accurate 6DoF (degrees of freedom) pose and focal length estimation are important in extended reality (XR) applications, enabling precise object alignment and projection scaling, thereby enhancing user experiences. This study focuses on improving 6DoF pose estimation using single RGB images of unknown camera metadata. Estimating the 6DoF pose and focal length from an uncontrolled RGB image, obtained from the internet, is challenging because it often lacks crucial metadata. Existing methods such as FocalPose and Focalpose++ have made progress in this domain but still face challenges due to the projection scale ambiguity between the translation of an object along the z-axis (tz) and the camera's focal length. To overcome this, we propose a two-stage strategy that decouples the projection scaling ambiguity in the estimation of z-axis translation and focal length. In the first stage, tz is set arbitrarily, and we predict all the other pose parameters and focal length relative to the fixed tz. In the second stage, we predict the true value of tz while scaling the focal length based on the tz update. The proposed two-stage method reduces projection scale ambiguity in RGB images and improves pose estimation accuracy. The iterative update rules constrained to the first stage and tailored loss functions including Huber loss in the second stage enhance the accuracy in both 6DoF pose and focal length estimation. Experimental results using benchmark datasets show significant improvements in terms of median rotation and translation errors, as well as better projection accuracy compared to the existing state-of-the-art methods. In an evaluation across the Pix3D datasets (chair, sofa, table, and bed), the proposed two-stage method improves projection accuracy by approximately 7.19%. Additionally, the incorporation of Huber loss resulted in a significant reduction in translation and focal length errors by 20.27% and 6.65%, respectively, in comparison to the Focalpose++ method.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] 6DoF Pose Estimation of Transparent Object from a Single RGB-D Image
    Xu, Chi
    Chen, Jiale
    Yao, Mengyang
    Zhou, Jun
    Zhang, Lijun
    Liu, Yi
    SENSORS, 2020, 20 (23) : 1 - 19
  • [2] End-to-End 6DoF Pose Estimation From Monocular RGB Images
    Zou, Wenbin
    Wu, Di
    Tian, Shishun
    Xiang, Canqun
    Li, Xia
    Zhang, Lu
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2021, 67 (01) : 87 - 96
  • [3] InstancePose: Fast 6DoF Pose Estimation for Multiple Objects from a Single RGB Image
    Aing, Lee
    Lie, Wen-Nung
    Chiang, Jui-Chiu
    Lin, Guo-Shiang
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 2621 - 2630
  • [4] Object aspect classification and 6DoF pose estimation
    Dede, Muhammet Ali
    Genc, Yakup
    IMAGE AND VISION COMPUTING, 2022, 124
  • [5] Detecting Object Surface Keypoints from a Single RGB Image via Deep Learning Network for 6DoF Pose Estimation
    Aing, Lee
    Lie, Wen-Nung
    2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 1673 - 1678
  • [6] 6DoF Pose Estimation for Intricately-Shaped Object
    Jiao, Tonghui
    Xia, Yanzhao
    Gao, Xiaosong
    Chen, Yongyu
    Zhao, Qunfei
    2019 3RD INTERNATIONAL SYMPOSIUM ON AUTONOMOUS SYSTEMS (ISAS 2019), 2019, : 199 - 204
  • [7] Spatial feature mapping for 6DoF object pose estimation
    Mei, Jianhan
    Jiang, Xudong
    Ding, Henghui
    PATTERN RECOGNITION, 2022, 131
  • [8] 6D-VNet: End-to-end 6DoF Vehicle Pose Estimation from Monocular RGB Images
    Wu, Di
    Zhuang, Zhaoyong
    Xiang, Canqun
    Zou, Wenbin
    Li, Xia
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1238 - 1247
  • [9] Optimizing RGB-D Fusion for Accurate 6DoF Pose Estimation
    Saadi, Lounes
    Besbes, Bassem
    Kramm, Sebastien
    Bensrhair, Abdelaziz
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 2413 - 2420
  • [10] 6DoF Pose Estimation with Object Cutout based on a Deep Autoencoder
    Liu, Xin
    Zhang, Jichao
    He, Xian
    Song, Xiuqiang
    Qin, Xueying
    ADJUNCT PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY (ISMAR-ADJUNCT 2019), 2019, : 360 - 365