MuDeepNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose Using Multi-view Consistency Loss

被引:9
|
作者
Zhang, Jun-Ning [1 ]
Su, Qun-Xing [2 ]
Liu, Peng-Yuan [1 ]
Ge, Hong-Yu [1 ]
Zhang, Ze-Feng [3 ]
机构
[1] Army Engn Univ, Shijiazhuang Campus, Shijiazhuang 050003, Hebei, Peoples R China
[2] Army Command Coll, Nanjing 210016, Jiangsu, Peoples R China
[3] 32180 Troops, Beijing 710032, Peoples R China
关键词
Deep learning; depth consistency loss; depth estimation; optical flow; optical flow consistency loss; visual odometry (VO);
D O I
10.1007/s12555-018-0926-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We take formulate structure from motion as a learning problem, and propose an end-to-end learning framework to calculate the image depth, optical flow, and the camera motion. This framework is composed of multiple encoder-decoder networks. The key part of the network structure is the FlowNet, which can improve the accuracy of the estimated camera ego-motion and depth. As with recent studies, we use an end-to-end learning approach with multi-view synthesis as a variety of supervision, and proposes multi-view consistency losses to constrain both depth and camera ego-motion, requiring only monocular video sequences for training. Compared to the recently popular depth-estimation-networks using a single image, our network learns to use motion parallax correction depth. Although MuDeepNet training requires the use of two adjacent frames to obtain motion parallax, it is tested by using a single image. Thus, MuDeepNet is a monocular system. The experiments on KITTI dataset show our MuDeepNet outperforms other methods.
引用
收藏
页码:2586 / 2596
页数:11
相关论文
共 49 条
  • [1] MuDeepNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose Using Multi-view Consistency Loss
    Jun-Ning Zhang
    Qun-Xing Su
    Peng-Yuan Liu
    Hong-Yu Ge
    Ze-Feng Zhang
    International Journal of Control, Automation and Systems, 2019, 17 : 2586 - 2596
  • [2] GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose
    Yin, Zhichao
    Shi, Jianping
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1983 - 1992
  • [3] UnLearnerMC: Unsupervised learning of dense depth and camera pose using mask and cooperative loss
    Zhang, Junning
    Su, Qunxing
    Liu, Pengyuan
    Xu, Chao
    Wang, Zhengjun
    KNOWLEDGE-BASED SYSTEMS, 2020, 192 (192)
  • [4] Multi-view Consistency as Supervisory Signal for Learning Shape and Pose Prediction
    Tulsiani, Shubham
    Efros, Alexei A.
    Malik, Jitendra
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2897 - 2905
  • [5] Deep learning based camera pose estimation in multi-view environment
    Charco, Jorge L.
    Vintimilla, Boris X.
    Sappa, Angel D.
    2018 14TH INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGY & INTERNET BASED SYSTEMS (SITIS), 2018, : 224 - 228
  • [6] Occlusion-Aware Unsupervised Learning of Monocular Depth, Optical Flow and Camera Pose with Geometric Constraints
    Teng, Qianru
    Chen, Yimin
    Huang, Chen
    FUTURE INTERNET, 2018, 10 (10)
  • [7] Unsupervised Learning of Depth Estimation and Camera Pose With Multi-Scale GANs
    Xu, Yufan
    Wang, Yan
    Huang, Rui
    Lei, Zeyu
    Yang, Junyao
    Li, Zijian
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 17039 - 17047
  • [8] Unsupervised cycle optimization learning for single-view depth and camera pose with Kalman filter
    Gu, Tianhao
    Wang, Zhe
    Chi, Ziqiu
    Zhu, Yiwen
    Du, Wenli
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106
  • [9] Unsupervised Monocular Estimation of Depth and Visual Odometry Using Attention and Depth-Pose Consistency Loss
    Song, Xiaogang
    Hu, Haoyue
    Liang, Li
    Shi, Weiwei
    Xie, Guo
    Lu, Xiaofeng
    Hei, Xinhong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3517 - 3529
  • [10] Pose Determination from Multi-View Image using Deep Learning
    Sun, Shantong
    Liu, Rongke
    Pan, Yu
    Du, Qiuchen
    Sun, Shuqiao
    Su, Han
    2019 15TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2019, : 1494 - 1498