Dense Prediction Transformer for Scale Estimation in Monocular Visual Odometry

被引:3
|
作者
Francani, Andre O. [1 ]
Maximo, Marcos R. O. A. [1 ]
机构
[1] Aeronaut Inst Technol, Autonomous Computat Syst Lab Lab SCA, Comp Sci Div, Sao Jose Dos Campos, SP, Brazil
来源
2022 LATIN AMERICAN ROBOTICS SYMPOSIUM (LARS), 2022 BRAZILIAN SYMPOSIUM ON ROBOTICS (SBR), AND 2022 WORKSHOP ON ROBOTICS IN EDUCATION (WRE) | 2022年
关键词
monocular visual odometry; scale estimation; deep learning; monocular depth estimation; vision transformer;
D O I
10.1109/LARS/SBR/WRE56824.2022.9995735
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Monocular visual odometry consists of the estimation of the position of an agent through images of a single camera, and it is applied in autonomous vehicles, medical robots, and augmented reality. However, monocular systems suffer from the scale ambiguity problem due to the lack of depth information in 2D frames. This paper contributes by showing an application of the dense prediction transformer model for scale estimation in monocular visual odometry systems. Experimental results show that the scale drift problem of monocular systems can be reduced through the accurate estimation of the depth map by this model, achieving competitive state-of-the-art performance on a visual odometry benchmark.
引用
收藏
页码:312 / 317
页数:6
相关论文
共 50 条
  • [1] Estimation of Image Scale Variations in Monocular Visual Odometry Systems
    Aqel, Mohammad O. A.
    Marhaban, Mohammad H.
    Saripan, M. Iqbal
    Ismail, Napsiah Bt.
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2017, 12 (02) : 228 - 243
  • [2] Geometry-Constrained Scale Estimation for Monocular Visual Odometry
    Zhang, Hui
    Wang, Xiangwei
    Yin, Xiaochuan
    Du, Mingxiao
    Liu, Chengju
    Chen, Qijun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 3144 - 3156
  • [3] Metric scale and angle estimation in monocular visual odometry with multiple distance sensors
    Olmez, Burhan
    Tuncer, Temel Engin
    DIGITAL SIGNAL PROCESSING, 2021, 117
  • [4] Depth Prediction for Monocular Direct Visual Odometry
    Cheng, Ran
    Agia, Christopher
    Meger, David
    Dudek, Gregory
    2020 17TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV 2020), 2020, : 70 - 77
  • [5] Resolving Scale Ambiguity for Monocular Visual Odometry
    Choi, Sunglok
    Park, Jaehyun
    Yu, Wonpil
    2013 10TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2013, : 604 - 608
  • [6] Simple But Effective Scale Estimation for Monocular Visual Odometry in Road Driving Scenarios
    Fan, Ming
    Kim, Seung-Wook
    Kim, Sung-Tae
    Sun, Jee-Young
    Ko, Sung-Jea
    IEEE ACCESS, 2020, 8 : 175891 - 175903
  • [7] Transformer-Based Model for Monocular Visual Odometry: A Video Understanding Approach
    Francani, Andre O.
    Maximo, Marcos R. O. A.
    IEEE ACCESS, 2025, 13 : 13959 - 13971
  • [8] SWformer-VO: A Monocular Visual Odometry Model Based on Swin Transformer
    Wu, Zhigang
    Zhu, Yaohui
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (05) : 4766 - 4773
  • [9] Eliminating Scale Ambiguity of Unsupervised Monocular Visual Odometry
    Wang, Zhongyi
    Shen, Mengjiao
    Chen, Qijun
    NEURAL PROCESSING LETTERS, 2023, 55 (07) : 9743 - 9764
  • [10] Eliminating Scale Ambiguity of Unsupervised Monocular Visual Odometry
    Zhongyi Wang
    Mengjiao Shen
    Qijun Chen
    Neural Processing Letters, 2023, 55 : 9743 - 9764