Unsupervised Monocular Depth and Camera Pose Estimation with Multiple Masks and Geometric Consistency Constraints

被引:1
作者
Zhang, Xudong [1 ]
Zhao, Baigan [2 ]
Yao, Jiannan [2 ]
Wu, Guoqing [1 ]
机构
[1] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China
[2] Nantong Univ, Sch Mech Engn, Nantong 226019, Peoples R China
基金
中国国家自然科学基金;
关键词
depth estimation; camera pose; visual odometry; unsupervised learning; VISUAL ODOMETRY;
D O I
10.3390/s23115329
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
This paper presents a novel unsupervised learning framework for estimating scene depth and camera pose from video sequences, fundamental to many high-level tasks such as 3D reconstruction, visual navigation, and augmented reality. Although existing unsupervised methods have achieved promising results, their performance suffers in challenging scenes such as those with dynamic objects and occluded regions. As a result, multiple mask technologies and geometric consistency constraints are adopted in this research to mitigate their negative impacts. Firstly, multiple mask technologies are used to identify numerous outliers in the scene, which are excluded from the loss computation. In addition, the identified outliers are employed as a supervised signal to train a mask estimation network. The estimated mask is then utilized to preprocess the input to the pose estimation network, mitigating the potential adverse effects of challenging scenes on pose estimation. Furthermore, we propose geometric consistency constraints to reduce the sensitivity of illumination changes, which act as additional supervised signals to train the network. Experimental results on the KITTI dataset demonstrate that our proposed strategies can effectively enhance the model's performance, outperforming other unsupervised methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] STRUCTURE GENERATION AND GUIDANCE NETWORK FOR UNSUPERVISED MONOCULAR DEPTH ESTIMATION
    Wang, Chaoqun
    Chen, Xuejin
    Min, Shaobo
    Wu, Feng
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1264 - 1269
  • [32] Unsupervised Monocular Depth Estimation From Light Field Image
    Zhou, Wenhui
    Zhou, Enci
    Liu, Gaomin
    Lin, Lili
    Lumsdaine, Andrew
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 1606 - 1617
  • [33] Unsupervised Monocular Depth Estimation Based on Scale Clue Enhancement
    Qu, Yi
    Chen, Ying
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (09): : 3217 - 3227
  • [34] Unsupervised Monocular Depth Estimation via Recursive Stereo Distillation
    Ye, Xinchen
    Fan, Xin
    Zhang, Mingliang
    Xu, Rui
    Zhong, Wei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4492 - 4504
  • [35] Efficient Unsupervised Monocular Depth Estimation with Inter-Frame Depth Interpolation
    Zhang, Min
    Li, Jianhua
    IMAGE AND GRAPHICS (ICIG 2021), PT III, 2021, 12890 : 729 - 741
  • [36] Unsupervised Domain Adaptation Depth Estimation Based on Self-attention Mechanism and Edge Consistency Constraints
    Guo, Peng
    Pan, Shuguo
    Hu, Peng
    Pei, Ling
    Yu, Baoguo
    NEURAL PROCESSING LETTERS, 2024, 56 (04)
  • [37] EndoSLAM dataset and an unsupervised monocular visual odometry and depth estimation approach for endoscopic videos
    Ozyoruk, Kutsev Bengisu
    Gokceler, Guliz Irem
    Bobrow, Taylor L.
    Coskun, Gulfize
    Incetan, Kagan
    Almalioglu, Yasin
    Mahmood, Faisal
    Curto, Eva
    Perdigoto, Luis
    Oliveira, Marina
    Sahin, Hasan
    Araujo, Helder
    Alexandrino, Henrique
    Durr, Nicholas J.
    Gilbert, Hunter B.
    Turan, Mehmet
    MEDICAL IMAGE ANALYSIS, 2021, 71
  • [38] DCL-depth: monocular depth estimation network based on iam and depth consistency loss
    Han C.
    Lv C.
    Kou Q.
    Jiang H.
    Cheng D.
    Multimedia Tools and Applications, 2025, 84 (8) : 4773 - 4787
  • [39] MBUDepthNet: Real-Time Unsupervised Monocular Depth Estimation Method for Outdoor Scenes
    Bian, Zhekai
    Wang, Xia
    Liu, Qiwei
    Lv, Shuaijun
    Wei, Ranfeng
    IEEE ACCESS, 2024, 12 : 63598 - 63609
  • [40] Advanced Monocular Outdoor Pose Estimation in Autonomous Systems: Leveraging Optical Flow, Depth Estimation, and Semantic Segmentation with Dynamic Object Removal
    Ghasemieh, Alireza
    Kashef, Rasha
    Sensors, 2024, 24 (24)