DoubleTake: Geometry Guided Depth Estimation

被引:0
作者
Sayed, Mohamed [1 ]
Aleotti, Filippo [1 ]
Watson, Jamie [1 ,2 ]
Ureshi, Zawar [1 ]
Garcia-Hernando, Guillermo [1 ]
Brostow, Gabriel [1 ,2 ]
Vicente, Sara [1 ]
Firman, Michael [1 ]
机构
[1] Niantic, San Francisco, CA 94111 USA
[2] UCL, London, England
来源
COMPUTER VISION - ECCV 2024, PT LXXIII | 2025年 / 15131卷
关键词
STEREO;
D O I
10.1007/978-3-031-73464-9_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Estimating depth from a sequence of posed RGB images is a fundamental computer vision task, with applications in augmented reality, path planning etc.. Prior work typically makes use of previous frames in a multi view stereo framework, relying on matching textures in a local neighborhood. In contrast, our model leverages historical predictions by giving the latest 3D geometry data as an extra input to our network. This self-generated geometric hint can encode information from areas of the scene not covered by the keyframes and it is more regularized when compared to individual predicted depth maps for previous frames. We introduce a Hint MLP which combines cost volume features with a hint of the prior geometry, rendered as a depth map from the current camera location, together with a measure of the confidence in the prior geometry. We demonstrate that our method, which can run at interactive speeds, achieves state-of-the-art estimates of depth and 3D scene reconstruction in both offline and incremental evaluation scenarios.
引用
收藏
页码:121 / 138
页数:18
相关论文
共 79 条
[1]  
Apple, 2023, ARKit
[2]   NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis [J].
Ben Mildenhall ;
Srinivasan, Pratul P. ;
Tancik, Matthew ;
Barron, Jonathan T. ;
Ramamoorthi, Ravi ;
Ng, Ren .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :405-421
[3]  
Bozic A, 2021, ADV NEUR IN
[4]   RIAV-MVS: Recurrent-Indexing an Asymmetric Volume for Multi-View Stereo [J].
Cai, Changjiang ;
Ji, Pan ;
Yan, Qingan ;
Xu, Yi .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :919-928
[5]  
Casser V, 2019, AAAI CONF ARTIF INTE, P8001
[6]   Pyramid Stereo Matching Network [J].
Chang, Jia-Ren ;
Chen, Yong-Sheng .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5410-5418
[7]  
Chen H., 2022, ICRA
[8]   Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution [J].
Chen, Yunpeng ;
Fan, Haoqi ;
Xu, Bing ;
Yan, Zhicheng ;
Kalantidis, Yannis ;
Rohrbach, Marcus ;
Yan, Shuicheng ;
Feng, Jiashi .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3434-3443
[9]  
Cheng X., 2019, IEEE T PATTERN ANAL
[10]  
Cheng Z., 2023, WACV