Deep learning based multi-view stereo matching and 3D scene reconstruction from oblique aerial images

被引:23
|
作者
Liu, Jin [1 ]
Gao, Jian [1 ]
Ji, Shunping [1 ]
Zeng, Chang [1 ]
Zhang, Shaoyi [1 ]
Gong, Jianya [1 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, 129 Luoyu Rd, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
3D scene reconstruction; Multi-view stereo; Oblique aerial images; Deep learning; Dense image matching;
D O I
10.1016/j.isprsjprs.2023.08.015
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
In this paper, we propose a practical three-dimensional (3D) real-scene reconstruction framework named Deep3D, which is paired with a deep learning based multi-view stereo (MVS) matching model named the adaptive multi-view aggregation matching (Ada-MVS) model, to obtain a 3D textured mesh model from multi view oblique aerial images. Deep3D is the first deep learning based framework for 3D scene reconstruction, in which aerial triangulation and view selection are first performed on the input images, and the depth map of each image is then inferred using the pretrained Ada-MVS model. All the inferred depth maps are then fused into a dense point cloud after filtering the outliers. Finally, the 3D textured mesh is extracted from the dense 3D points as the final product. In the Ada-MVS model, a novel adaptive inter-view aggregation module is specially proposed to address the inconsistent information among oblique views and to fuse the multi-view costs into a robust cost volume. A lightweight recurrent regularization module is also designed for high-efficiency processing of high-capacity aerial images with large depth variations. Moreover, as oblique aerial image datasets are currently lacking, we built a large-scale synthetic multi-view oblique aerial image dataset (WHU-OMVS dataset) for deep learning based model training and methodology evaluation for the task of 3D scene reconstruction. The experimental results show that, firstly, the proposed Ada-MVS model has obvious advantages when used with high capacity oblique aerial images, compared with several relevant learning-based MVS methods. Secondly, through a comprehensive comparison with popular commercial software packages and open-source solutions, it is shown that the proposed Deep3D framework outperforms all the other solutions in terms of reconstruction quality, and outperforms all the open-source solutions and some of the software packages in terms of efficiency on the WHU-OMVS dataset. Thirdly, the Deep3D framework shows a stable generalization ability and excellent performance when applied to other oblique or nadir aerial images, without any further fine-tuning. The dataset and code will be available at http://gpcv.whu.edu.cn/data.
引用
收藏
页码:42 / 60
页数:19
相关论文
共 50 条
  • [21] Multi-view stereo for weakly textured indoor 3D reconstruction
    Wang, Tao
    Gan, Vincent J. L.
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2024, 39 (10) : 1469 - 1489
  • [22] Pruning multi-view stereo net for efficient 3D reconstruction
    Xiang, Xiang
    Wang, Zhiyuan
    Lao, Shanshan
    Zhang, Baochang
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 168 (168) : 17 - 27
  • [23] Revisiting PatchMatch Multi-View Stereo for Urban 3D Reconstruction
    Orsingher, Marco
    Zani, Paolo
    Medici, Paolo
    Bertozzi, Massimo
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 190 - 196
  • [24] An attention-based and deep sparse priori cascade multi-view stereo network for 3D reconstruction
    Wang, Yadong
    Ran, Teng
    Liang, Yuan
    Zheng, Guoquan
    COMPUTERS & GRAPHICS-UK, 2023, 116 : 383 - 392
  • [25] Fast Window Based Stereo Matching for 3D Scene Reconstruction
    Chowdhury, Mohammad Mozammel
    Bhuiyah, Mohammad Al-Amin
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2013, 10 (03) : 209 - 214
  • [26] 3D Concept Learning and Reasoning from Multi-View Images
    Hong, Yining
    Lin, Chunru
    Du, Yilun
    Chen, Zhenfang
    Tenenbaum, Joshua B.
    Gan, Chuang
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9202 - 9212
  • [27] 3D reconstruction and depth estimation method for local anomalies of rail surface based on multi-view stereo matching
    Hu, Pengyu
    Zhong, Qianwen
    Zheng, Shubin
    Chen, Xieqi
    Peng, Lele
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2025, 36 (01)
  • [28] Towards Dense 3D Reconstruction for Mixed Reality in Healthcare: Classical Multi-View Stereo vs Deep Learning
    Prokopetc, Kristina
    Dupont, Romain
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2061 - 2069
  • [29] A Hybrid Multi-View 3D Reconstruction Method Based on Scene Graph Partition
    Xue J.-S.
    Yi H.
    Wu Z.-H.
    Chen X.-N.
    Zidonghua Xuebao/Acta Automatica Sinica, 2020, 46 (04): : 782 - 795
  • [30] Multi-View Images 3D Reconstruction based on Spatial Geometric Constraint
    Liu, Haibo
    PROCEEDINGS OF THE 2016 2ND WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS, 2016, 81 : 1217 - 1220