Bidirectional Hybrid LSTM Based Recurrent Neural Network for Multi-View Stereo

被引:10
作者
Wei, Zizhuang [1 ]
Zhu, Qingtian [1 ]
Min, Chen [1 ]
Chen, Yisong [1 ]
Wang, Guoping [1 ]
机构
[1] Peking Univ, Dept EECS, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
Costs; Feature extraction; Three-dimensional displays; Runtime; Point cloud compression; Image reconstruction; Recurrent neural networks; 3D reconstruction; deep learning; multi-view stereo; recurrent neural network; point clouds;
D O I
10.1109/TVCG.2022.3165860
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Recently, deep learning based multi-view stereo (MVS) networks have demonstrated their excellent performance on various benchmarks. In this paper, we present an effective and efficient recurrent neural network (RNN) for accurate and complete dense point cloud reconstruction. Instead of regularizing the cost volume via conventional 3D CNN or unidirectional RNN like previous attempts, we adopt a bidirectional hybrid Long Short-Term Memory (LSTM) based structure for cost volume regularization. The proposed bidirectional recurrent regularization is able to perceive full-space context information comparable to 3D CNNs while saving runtime memory. For post-processing, we introduce a visibility based approach for depth map refinement to obtain more accurate dense point clouds. Extensive experiments on DTU, Tanks and Temples and ETH3D datasets demonstrate that our method outperforms previous state-of-the-art MVS methods and exhibits high memory efficiency at runtime.
引用
收藏
页码:3062 / 3073
页数:12
相关论文
共 41 条
[1]   Large-Scale Data for Multiple-View Stereopsis [J].
Aanaes, Henrik ;
Jensen, Rasmus Ramsbol ;
Vogiatzis, George ;
Tola, Engin ;
Dahl, Anders Bjorholm .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 120 (02) :153-168
[2]   PatchMatch: A Randomized Correspondence Algorithm for Structural Image Editing [J].
Barnes, Connelly ;
Shechtman, Eli ;
Finkelstein, Adam ;
Goldman, Dan B. .
ACM TRANSACTIONS ON GRAPHICS, 2009, 28 (03)
[3]   Point-Based Multi-View Stereo Network [J].
Chen, Rui ;
Han, Songfang ;
Xu, Jing ;
Su, Hao .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1538-1547
[4]   Deep Stereo using Adaptive Thin Volume Representation with Uncertainty Awareness [J].
Cheng, Shuo ;
Xu, Zexiang ;
Zhu, Shilin ;
Li, Zhuwen ;
Li, Li Erran ;
Ramamoorthi, Ravi ;
Su, Hao .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :2521-2531
[5]   MVE-An image-based reconstruction environment [J].
Fuhrmann, Simon ;
Langguth, Fabian ;
Moehrle, Nils ;
Waechter, Michael ;
Goesele, Michael .
COMPUTERS & GRAPHICS-UK, 2015, 53 :44-53
[6]   Accurate, Dense, and Robust Multiview Stereopsis [J].
Furukawa, Yasutaka ;
Ponce, Jean .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (08) :1362-1376
[7]   Massively Parallel Multiview Stereopsis by Surface Normal Diffusion [J].
Galliani, Silvano ;
Lasinger, Katrin ;
Schindler, Konrad .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :873-881
[8]   Cascade Cost Volume for High-Resolution Multi-View Stereo and Stereo Matching [J].
Gu, Xiaodong ;
Fan, Zhiwen ;
Zhu, Siyu ;
Dai, Zuozhuo ;
Tan, Feitong ;
Tan, Ping .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :2492-2501
[9]   DeepMVS: Learning Multi-view Stereopsis [J].
Huang, Po-Han ;
Matzen, Kevin ;
Kopf, Johannes ;
Ahuja, Narendra ;
Huang, Jia-Bin .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2821-2830
[10]  
Ioffe S, 2015, PR MACH LEARN RES, V37, P448