Unsupervised Learning of Depth and Pose Based on Monocular Camera and Inertial Measurement Unit (IMU)

被引:3
|
作者
Wang, Yanbo [1 ]
Yang, Hanwen [2 ]
Cai, Jianwei [3 ]
Wang, Guangming [4 ,5 ]
Wang, Jingchuan [4 ,5 ]
Huang, Yi [6 ]
机构
[1] Shanghai Jiao Tong Univ, Univ Michigan Shanghai Jiao Tong Univ Joint Inst, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
[3] Shanghai Jiao Tong Univ, Inst Image Proc & Pattern Recognit, Shanghai 200240, Peoples R China
[4] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
[5] Minist Educ China, Key Lab Syst Control & Informat Proc, Beijing, Peoples R China
[6] Shanghai Weitong Vis Technol Co Ltd, Shanghai, Peoples R China
关键词
D O I
10.1109/ICRA48891.2023.10160277
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The main content of the research in this paper is the estimation of depth and pose based on monocular vision and Inertial Measurement Unit (IMU). The usual depth estimation network and pose estimation network require depth ground truth or pose ground truth as a supervised signal for training, while the depth ground truth and pose ground truth are hard to obtain, and monocular vision based depth estimation cannot predict absolute depth. In this paper, with the help of IMU, which is inexpensive and widely used, we can obtain angular velocity and acceleration information. Two new supervision signals are proposed and the calculation expressions are given. Among them, the model trained with acceleration constraint shows a good ability to estimate the absolute depth during the test. It can be considered that the model can estimate the absolute depth. We also derive the method of estimating the scale factor during the test from the acceleration constraint, and also achieve good results as the acceleration constraint does. In addition, this paper also studies the method of using IMU information as pose network input and as selecting conditions. Moreover, it analyzes and discusses the experimental results. At the same time, we also evaluate the effect of the pose estimation of the relevant models. This article starts by reviewing the achievements and deficiencies of the work in this field, combines the use of IMU, puts forward three new methods such as a new loss function, and conducts a test analysis and discussion of relevant indicators on the KITTI data set.
引用
收藏
页码:10010 / 10017
页数:8
相关论文
共 50 条
  • [21] A Wireless Micro Inertial Measurement Unit (IMU)
    Hoeflinger, Fabian
    Mueller, Joerg
    Zhang, Rui
    Reindl, Leonhard M.
    Burgard, Wolfram
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2013, 62 (09) : 2583 - 2595
  • [22] Joint calibration of an inertial measurement unit and coordinate transformation parameters using a monocular camera
    Zachariah, Dave
    Jansson, Magnus
    2010 INTERNATIONAL CONFERENCE ON INDOOR POSITIONING AND INDOOR NAVIGATION, 2010,
  • [23] UnLearnerMC: Unsupervised learning of dense depth and camera pose using mask and cooperative loss
    Zhang, Junning
    Su, Qunxing
    Liu, Pengyuan
    Xu, Chao
    Wang, Zhengjun
    KNOWLEDGE-BASED SYSTEMS, 2020, 192 (192)
  • [24] Unsupervised Learning of Accurate Camera Pose and Depth From Video Sequences With Kalman Filter
    Wang, Yan
    Xu, Yu-Fan
    IEEE ACCESS, 2019, 7 : 32796 - 32804
  • [25] Inertial aiding of inverse depth SLAM using a monocular camera
    Pinies, Pedro
    Lupton, Todd
    Sukkarieh, Salah
    Tardos, Juan D.
    PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-10, 2007, : 2797 - +
  • [26] Unsupervised Learning of Monocular Depth from Videos
    Gao Haosheng
    Teng Wang
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 3945 - 3950
  • [27] 3D human pose estimation with single image and inertial measurement unit (IMU) sequence
    Liu, Liujun
    Yang, Jiewen
    Lin, Ye
    Zhang, Peixuan
    Zhang, Lihua
    PATTERN RECOGNITION, 2024, 149
  • [28] On the use of IMU (inertial measurement unit) sensors in geomorphology
    Maniatis, Georgios
    EARTH SURFACE PROCESSES AND LANDFORMS, 2021, 46 (11) : 2136 - 2140
  • [29] 3D Hierarchical Refinement and Augmentation for Unsupervised Learning of Depth and Pose From Monocular Video
    Wang, Guangming
    Zhong, Jiquan
    Zhao, Shijie
    Wu, Wenhua
    Liu, Zhe
    Wang, Hesheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (04) : 1776 - 1786
  • [30] Unsupervised Monocular Depth and Pose Estimation Using Multiple Masks Based on Photometric and Geometric Consistency
    Kong, Huifang
    Liu, Tiankuo
    Hu, Jie
    Fang, Yao
    Sun, Jixing
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 3558 - 3563