Scale-Aware Visual-Inertial Depth Estimation and Odometry Using Monocular Self-Supervised Learning

被引:4
|
作者
Lee, Chungkeun [1 ]
Kim, Changhyeon [2 ]
Kim, Pyojin [3 ]
Lee, Hyeonbeom [4 ]
Kim, H. Jin [5 ]
机构
[1] Seoul Natl Univ, Inst Adv Aerosp Technol, Seoul 08826, South Korea
[2] Seoul Natl Univ, Automation & Syst Res Inst, Seoul 08826, South Korea
[3] Sookmyung Womens Univ, Dept Mech Syst Engn, Seoul 04312, South Korea
[4] Kyungpook Natl Univ, Sch Elect & Elect Engn, Daegu 37224, South Korea
[5] Seoul Natl Univ, Dept Mech & Aerosp Engn, Seoul 08826, South Korea
基金
新加坡国家研究基金会;
关键词
Odometry; Deep learning; Loss measurement; Depth measurement; Cameras; Self-supervised learning; Coordinate measuring machines; monocular depth estimation; self-supervised learning; visual-inertial odometry;
D O I
10.1109/ACCESS.2023.3252884
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For real-world applications with a single monocular camera, scale ambiguity is an important issue. Because self-supervised data-driven approaches that do not require additional data containing scale information cannot avoid the scale ambiguity, state-of-the-art deep-learning-based methods address this issue by learning the scale information from additional sensor measurements. In that regard, inertial measurement unit (IMU) is a popular sensor for various mobile platforms due to its lightweight and inexpensiveness. However, unlike supervised learning that can learn the scale from the ground-truth information, learning the scale from IMU is challenging in a self-supervised setting. We propose a scale-aware monocular visual-inertial depth estimation and odometry method with end-to-end training. To learn the scale from the IMU measurements with end-to-end training in the monocular self-supervised setup, we propose a new loss function named as preintegration loss function, which trains scale-aware ego-motion by comparing the ego-motion integrated from IMU measurement and predicted ego-motion. Since the gravity and the bias should be compensated to obtain the ego-motion by integrating IMU measurements, we design a network to predict the gravity and the bias in addition to the ego-motion and the depth map. The overall performance of the proposed method is compared to state-of-the-art methods in the popular outdoor driving dataset, i.e., KITTI dataset, and the author-collected indoor driving dataset. In the KITTI dataset, the proposed method shows competitive performance compared with state-of-the-art monocular depth estimation and odometry methods, i.e., root-mean-square error of 5.435 m in the KITTI Eigen split and absolute trajectory error of 22.46 m and 0.2975 degrees in the KITTI odometry 09 sequence. Different from other up-to-scale monocular methods, the proposed method can estimate the metric-scaled depth and camera poses. Additional experiments on the author-collected indoor driving dataset qualitatively confirm the accurate performance of metric-depth and metric pose estimations.
引用
收藏
页码:24087 / 24102
页数:16
相关论文
共 50 条
  • [41] Self-Supervised Learning of Monocular Depth Estimation Based on Progressive Strategy
    Wang, Huachun
    Sang, Xinzhu
    Chen, Duo
    Wang, Peng
    Yan, Binbin
    Qi, Shuai
    Ye, Xiaoqian
    Yao, Tong
    IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2021, 7 : 375 - 383
  • [42] Depth estimation algorithm of monocular image based on self-supervised learning
    Bai L.
    Liu L.-J.
    Li X.-A.
    Wu S.
    Liu R.-Q.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2023, 53 (04): : 1139 - 1145
  • [43] SENSE: Self-Evolving Learning for Self-Supervised Monocular Depth Estimation
    Li, Guanbin
    Huang, Ricong
    Li, Haofeng
    You, Zunzhi
    Chen, Weikai
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 439 - 450
  • [44] Self-supervised learning monocular depth estimation from internet photos
    Lin, Xiaocan
    Li, Nan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 99
  • [45] Dense Depth Estimation in Monocular Endoscopy With Self-Supervised Learning Methods
    Liu, Xingtong
    Sinha, Ayushi
    Ishii, Masaru
    Hager, Gregory D.
    Reiter, Austin
    Taylor, Russell H.
    Unberath, Mathias
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (05) : 1438 - 1447
  • [46] Self-supervised learning of monocular depth using quantized networks
    Lu, Keyu
    Zeng, Chengyi
    Zeng, Yonghu
    NEUROCOMPUTING, 2022, 488 : 634 - 646
  • [47] Monocular depth estimation using self-supervised learning with more effective geometric constraints
    Xiong, Mingkang
    Zhang, Zhenghong
    Liu, Jiyuan
    Zhang, Tao
    Xiong, Huilin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 128
  • [48] Scale-aware Semi-Direct Monocular Visual Odometry with Points and Lines
    Guo, Shuang
    Zhang, Yang
    Chen, Yang
    Sun, Lianxia
    PROCEEDINGS OF 2020 3RD INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2020, : 627 - 631
  • [49] Monocular depth estimation using self-supervised learning with more effective geometric constraints
    Xiong, Mingkang
    Zhang, Zhenghong
    Liu, Jiyuan
    Zhang, Tao
    Xiong, Huilin
    Engineering Applications of Artificial Intelligence, 2024, 128
  • [50] Confidence-aware self-supervised learning for dense monocular depth estimation in dynamic laparoscopic scene
    Hirohata, Yasuhide
    Sogabe, Maina
    Miyazaki, Tetsuro
    Kawase, Toshihiro
    Kawashima, Kenji
    SCIENTIFIC REPORTS, 2023, 13 (01):