Unsupervised Learning of Depth and Pose Based on Monocular Camera and Inertial Measurement Unit (IMU)

被引:3
作者
Wang, Yanbo [1 ]
Yang, Hanwen [2 ]
Cai, Jianwei [3 ]
Wang, Guangming [4 ,5 ]
Wang, Jingchuan [4 ,5 ]
Huang, Yi [6 ]
机构
[1] Shanghai Jiao Tong Univ, Univ Michigan Shanghai Jiao Tong Univ Joint Inst, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
[3] Shanghai Jiao Tong Univ, Inst Image Proc & Pattern Recognit, Shanghai 200240, Peoples R China
[4] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
[5] Minist Educ China, Key Lab Syst Control & Informat Proc, Beijing, Peoples R China
[6] Shanghai Weitong Vis Technol Co Ltd, Shanghai, Peoples R China
来源
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023) | 2023年
关键词
D O I
10.1109/ICRA48891.2023.10160277
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The main content of the research in this paper is the estimation of depth and pose based on monocular vision and Inertial Measurement Unit (IMU). The usual depth estimation network and pose estimation network require depth ground truth or pose ground truth as a supervised signal for training, while the depth ground truth and pose ground truth are hard to obtain, and monocular vision based depth estimation cannot predict absolute depth. In this paper, with the help of IMU, which is inexpensive and widely used, we can obtain angular velocity and acceleration information. Two new supervision signals are proposed and the calculation expressions are given. Among them, the model trained with acceleration constraint shows a good ability to estimate the absolute depth during the test. It can be considered that the model can estimate the absolute depth. We also derive the method of estimating the scale factor during the test from the acceleration constraint, and also achieve good results as the acceleration constraint does. In addition, this paper also studies the method of using IMU information as pose network input and as selecting conditions. Moreover, it analyzes and discusses the experimental results. At the same time, we also evaluate the effect of the pose estimation of the relevant models. This article starts by reviewing the achievements and deficiencies of the work in this field, combines the use of IMU, puts forward three new methods such as a new loss function, and conducts a test analysis and discussion of relevant indicators on the KITTI data set.
引用
收藏
页码:10010 / 10017
页数:8
相关论文
共 50 条
  • [31] Unsupervised Monocular Depth and Pose Estimation Using Multiple Masks Based on Photometric and Geometric Consistency
    Kong, Huifang
    Liu, Tiankuo
    Hu, Jie
    Fang, Yao
    Sun, Jixing
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 3558 - 3563
  • [32] Unsupervised cycle optimization learning for single-view depth and camera pose with Kalman filter
    Gu, Tianhao
    Wang, Zhe
    Chi, Ziqiu
    Zhu, Yiwen
    Du, Wenli
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106
  • [33] Assessment of Breathing Parameters Using an Inertial Measurement Unit (IMU)-Based System
    Cesareo, Ambra
    Previtali, Ylenia
    Biffi, Emilia
    Aliverti, Andrea
    SENSORS, 2019, 19 (01)
  • [34] Comparison of Camera based and Inertial Measurement Unit based Motion Analysis
    Jang, Seongho
    Park, Si-bog
    Moon, Sang-bog
    Kim, Jae Min
    Lee, Shi-uk
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON SENSOR NETWORKS (SENSORNETS), 2018, : 161 - 167
  • [35] Unsupervised Learning of Depth and Pose Estimation based on Continuous Frame Window
    Shang, Suning
    Wang, Huaimin
    Zhang, Pengfei
    Ding, Bo
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [36] Unsupervised learning for monocular depth and motion with real scale
    Liu, Sibo
    Fang, Lijin
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 630 - 635
  • [37] Learning monocular depth estimation with unsupervised trinocular assumptions
    Poggi, Matteo
    Tosi, Fabio
    Mattoccia, Stefano
    2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 324 - 333
  • [38] An Adaptive Unsupervised Learning Framework for Monocular Depth Estimation
    Yang, Delong
    Zhong, Xunyu
    Lin, Lixiong
    Peng, Xiafu
    IEEE ACCESS, 2019, 7 : 148142 - 148151
  • [39] Learning-Based Depth and Pose Estimation for Monocular Endoscope with Loss Generalization
    Widya, Aji Resindra
    Monno, Yusuke
    Okutomi, Masatoshi
    Suzuki, Sho
    Gotoda, Takuji
    Miki, Kenji
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 3547 - 3552
  • [40] Inertial measurement unit-camera calibration based on incomplete inertial sensor information
    Hong Liu
    Yu-long Zhou
    Zhao-peng Gu
    Journal of Zhejiang University SCIENCE C, 2014, 15 : 999 - 1008