Unsupervised Monocular Depth and Camera Pose Estimation with Multiple Masks and Geometric Consistency Constraints

被引:1
作者
Zhang, Xudong [1 ]
Zhao, Baigan [2 ]
Yao, Jiannan [2 ]
Wu, Guoqing [1 ]
机构
[1] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China
[2] Nantong Univ, Sch Mech Engn, Nantong 226019, Peoples R China
基金
中国国家自然科学基金;
关键词
depth estimation; camera pose; visual odometry; unsupervised learning; VISUAL ODOMETRY;
D O I
10.3390/s23115329
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
This paper presents a novel unsupervised learning framework for estimating scene depth and camera pose from video sequences, fundamental to many high-level tasks such as 3D reconstruction, visual navigation, and augmented reality. Although existing unsupervised methods have achieved promising results, their performance suffers in challenging scenes such as those with dynamic objects and occluded regions. As a result, multiple mask technologies and geometric consistency constraints are adopted in this research to mitigate their negative impacts. Firstly, multiple mask technologies are used to identify numerous outliers in the scene, which are excluded from the loss computation. In addition, the identified outliers are employed as a supervised signal to train a mask estimation network. The estimated mask is then utilized to preprocess the input to the pose estimation network, mitigating the potential adverse effects of challenging scenes on pose estimation. Furthermore, we propose geometric consistency constraints to reduce the sensitivity of illumination changes, which act as additional supervised signals to train the network. Experimental results on the KITTI dataset demonstrate that our proposed strategies can effectively enhance the model's performance, outperforming other unsupervised methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Unsupervised Monocular Depth and Pose Estimation Using Multiple Masks Based on Photometric and Geometric Consistency
    Kong, Huifang
    Liu, Tiankuo
    Hu, Jie
    Fang, Yao
    Sun, Jixing
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 3558 - 3563
  • [2] Occlusion-Aware Unsupervised Learning of Monocular Depth, Optical Flow and Camera Pose with Geometric Constraints
    Teng, Qianru
    Chen, Yimin
    Huang, Chen
    FUTURE INTERNET, 2018, 10 (10)
  • [3] Unsupervised monocular visual odometry with decoupled camera pose estimation
    Lin, Lili
    Wang, Weisheng
    Luo, Wan
    Song, Lesheng
    Zhou, Wenhui
    DIGITAL SIGNAL PROCESSING, 2021, 114
  • [4] Unsupervised Estimation of Monocular Depth and VO in Dynamic Environments via Hybrid Masks
    Sun, Qiyu
    Tang, Yang
    Zhang, Chongzhen
    Zhao, Chaoqiang
    Qian, Feng
    Kurths, Jurgen
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (05) : 2023 - 2033
  • [5] Unsupervised Monocular Estimation of Depth and Visual Odometry Using Attention and Depth-Pose Consistency Loss
    Song, Xiaogang
    Hu, Haoyue
    Liang, Li
    Shi, Weiwei
    Xie, Guo
    Lu, Xiaofeng
    Hei, Xinhong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3517 - 3529
  • [6] Unsupervised Learning of Depth Estimation and Camera Pose With Multi-Scale GANs
    Xu, Yufan
    Wang, Yan
    Huang, Rui
    Lei, Zeyu
    Yang, Junyao
    Li, Zijian
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 17039 - 17047
  • [7] An Adaptive Unsupervised Learning Framework for Monocular Depth Estimation
    Yang, Delong
    Zhong, Xunyu
    Lin, Lixiong
    Peng, Xiafu
    IEEE ACCESS, 2019, 7 : 148142 - 148151
  • [8] Unsupervised Learning of Monocular Depth and Ego-Motion with Optical Flow Features and Multiple Constraints
    Zhao, Baigan
    Huang, Yingping
    Ci, Wenyan
    Hu, Xing
    SENSORS, 2022, 22 (04)
  • [9] Monocular Depth Estimation Based on Unsupervised Learning
    Liu, Wan
    Sun, Yan
    Wang, XuCheng
    Yang, Lin
    Zheng, Zhenrong
    OPTOELECTRONIC IMAGING AND MULTIMEDIA TECHNOLOGY VI, 2019, 11187
  • [10] Masked GAN for Unsupervised Depth and Pose Prediction With Scale Consistency
    Zhao, Chaoqiang
    Yen, Gary G.
    Sun, Qiyu
    Zhang, Chongzhen
    Tang, Yang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (12) : 5392 - 5403