Deep learning based multi-view stereo matching and 3D scene reconstruction from oblique aerial images

被引:23
|
作者
Liu, Jin [1 ]
Gao, Jian [1 ]
Ji, Shunping [1 ]
Zeng, Chang [1 ]
Zhang, Shaoyi [1 ]
Gong, Jianya [1 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, 129 Luoyu Rd, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
3D scene reconstruction; Multi-view stereo; Oblique aerial images; Deep learning; Dense image matching;
D O I
10.1016/j.isprsjprs.2023.08.015
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
In this paper, we propose a practical three-dimensional (3D) real-scene reconstruction framework named Deep3D, which is paired with a deep learning based multi-view stereo (MVS) matching model named the adaptive multi-view aggregation matching (Ada-MVS) model, to obtain a 3D textured mesh model from multi view oblique aerial images. Deep3D is the first deep learning based framework for 3D scene reconstruction, in which aerial triangulation and view selection are first performed on the input images, and the depth map of each image is then inferred using the pretrained Ada-MVS model. All the inferred depth maps are then fused into a dense point cloud after filtering the outliers. Finally, the 3D textured mesh is extracted from the dense 3D points as the final product. In the Ada-MVS model, a novel adaptive inter-view aggregation module is specially proposed to address the inconsistent information among oblique views and to fuse the multi-view costs into a robust cost volume. A lightweight recurrent regularization module is also designed for high-efficiency processing of high-capacity aerial images with large depth variations. Moreover, as oblique aerial image datasets are currently lacking, we built a large-scale synthetic multi-view oblique aerial image dataset (WHU-OMVS dataset) for deep learning based model training and methodology evaluation for the task of 3D scene reconstruction. The experimental results show that, firstly, the proposed Ada-MVS model has obvious advantages when used with high capacity oblique aerial images, compared with several relevant learning-based MVS methods. Secondly, through a comprehensive comparison with popular commercial software packages and open-source solutions, it is shown that the proposed Deep3D framework outperforms all the other solutions in terms of reconstruction quality, and outperforms all the open-source solutions and some of the software packages in terms of efficiency on the WHU-OMVS dataset. Thirdly, the Deep3D framework shows a stable generalization ability and excellent performance when applied to other oblique or nadir aerial images, without any further fine-tuning. The dataset and code will be available at http://gpcv.whu.edu.cn/data.
引用
收藏
页码:42 / 60
页数:19
相关论文
共 50 条
  • [1] Automatic 3D building reconstruction from multi-view aerial images with deep learning
    Yu, Dawen
    Ji, Shunping
    Liu, Jin
    Wei, Shiqing
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 171 : 155 - 170
  • [2] A general deep learning based framework for 3D reconstruction from multi-view stereo satellite images
    Gao, Jian
    Liu, Jin
    Ji, Shunping
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 195 : 446 - 461
  • [3] An Extension of PatchMatch Stereo for 3D Reconstruction from Multi-View Images
    Hiradate, Mutsuki
    Ito, Koichi
    Aoki, Takafumi
    Watanabe, Takafumi
    Unten, Hiroki
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 61 - 65
  • [4] Deep Learning for 3D Scene Reconstruction and Segmentation from Stereo Images
    Kniaz, Vladimir V.
    Knyaz, Vladimir A.
    Ippolitov, Evgeny, V
    Novikov, Mikhail M.
    Grodzistky, Lev
    Moshkantsev, Petr
    MULTIMODAL SENSING AND ARTIFICIAL INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS II, 2021, 11785
  • [5] Improvement on Matching Breakage of Multi-View Stereo 3D Reconstruction
    Lin, Hung-Lin
    Lin, Tsung-Yi
    Li, Yi-Xuan
    Tseng, Yu-Sheng
    Li, Xin-Yi
    Cal, Qlan-Wen
    Chen, Zheng
    Shi, Yi-Rou
    PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON ADVANCED MATERIALS FOR SCIENCE AND ENGINEERING (IEEE-ICAMSE 2016), 2016, : 423 - 425
  • [6] Underwater 3D reconstruction based on multi-view stereo
    Gu, Feifei
    Zhao, Juan
    Xu, Pei
    Huang, Shulan
    Zhang, Gaopeng
    Song, Zhan
    OCEAN OPTICS AND INFORMATION TECHNOLOGY, 2018, 10850
  • [7] Multi-View Stereo 3D Edge Reconstruction
    Bignoli, Andrea
    Romanoni, Andrea
    Matteucci, Matteo
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 867 - 875
  • [8] 3D Face Reconstruction based on Multi-View Stereo Algorithm
    Peng, Keju
    Guan, Tao
    Xu, Chao
    Zhou, Dongxiang
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS IEEE-ROBIO 2014, 2014, : 2310 - 2314
  • [9] PlaneMVS: 3D Plane Reconstruction from Multi-View Stereo
    Liu, Jiachen
    Ji, Pan
    Bansal, Nitin
    Cai, Changjiang
    Yan, Qingan
    Huang, Xiaolei
    Xu, Yi
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8655 - 8665
  • [10] 3D building model reconstruction from multi-view aerial images and LiDAR data
    Cheng, Liang
    Gong, Jianya
    Li, Manchun
    Liu, Yongxue
    Song, Xiaogang
    Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2009, 38 (06): : 494 - 501