FG-Depth: Flow-Guided Unsupervised Monocular Depth Estimation

被引:1
作者
Zhu, Junyu [1 ]
Liu, Lina [1 ]
Liu, Yong [1 ]
Li, Wanlong [2 ]
Wen, Feng [2 ]
Zhang, Hongbo [2 ]
机构
[1] Zhejiang Univ, Inst Cyber Syst & Control, Hangzhou, Peoples R China
[2] Huawei Technol, Noahs Ark Lab, Beijing, Peoples R China
来源
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA | 2023年
关键词
D O I
10.1109/ICRA48891.2023.10160534
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The great potential of unsupervised monocular depth estimation has been demonstrated by many works due to low annotation cost and impressive accuracy comparable to supervised methods. To further improve the performance, recent works mainly focus on designing more complex network structures and exploiting extra supervised information, e.g., semantic segmentation. These methods optimize the models by exploiting the reconstructed relationship between the target and reference images in varying degrees. However, previous methods prove that this image reconstruction optimization is prone to get trapped in local minima. In this paper, our core idea is to guide the optimization with prior knowledge from pretrained Flow-Net. And we show that the bottleneck of unsupervised monocular depth estimation can be broken with our simple but effective framework named FG-Depth. In particular, we propose (i) a flow distillation loss to replace the typical photometric loss that limits the capacity of the model and (ii) a prior flow based mask to remove invalid pixels that bring the noise in training loss. Extensive experiments demonstrate the effectiveness of each component, and our approach achieves state-of-the-art results on both KITTI and NYU-Depth-v2 datasets.
引用
收藏
页码:4924 / 4930
页数:7
相关论文
共 42 条
  • [1] [Anonymous], 2018, ARXIV181003654
  • [2] Bhat S. F., 2020, CVPR
  • [3] Bin Cheng, 2020, Computer Vision - ECCV 2020 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12375), P52, DOI 10.1007/978-3-030-58577-8_4
  • [4] Chang Shu, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12364), P572, DOI 10.1007/978-3-030-58529-7_34
  • [5] Cho K., 2014, ARXIV, DOI 10.3115/v1/w14-4012
  • [6] Dan X., 2018, IEEE
  • [7] Dosovitskiy A., 2021, ICLR
  • [8] FlowNet: Learning Optical Flow with Convolutional Networks
    Dosovitskiy, Alexey
    Fischer, Philipp
    Ilg, Eddy
    Haeusser, Philip
    Hazirbas, Caner
    Golkov, Vladimir
    van der Smagt, Patrick
    Cremers, Daniel
    Brox, Thomas
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2758 - 2766
  • [9] Eigen D., 2014, NIPS, DOI DOI 10.5555/2969033.2969091
  • [10] Deep Ordinal Regression Network for Monocular Depth Estimation
    Fu, Huan
    Gong, Mingming
    Wang, Chaohui
    Batmanghelich, Kayhan
    Tao, Dacheng
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2002 - 2011