A general deep learning based framework for 3D reconstruction from multi-view stereo satellite images

被引:25
|
作者
Gao, Jian [1 ]
Liu, Jin [1 ]
Ji, Shunping [1 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, 129 Luoyu Rd, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi -view stereo; Optical satellite images; Deep learning; Dense matching; 3D reconstruction; IKONOS; MODEL;
D O I
10.1016/j.isprsjprs.2022.12.012
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
In this paper, we propose a general deep learning based framework, named Sat-MVSF, to perform threedimensional (3D) reconstruction of the Earth's surface from multi-view optical satellite images. The framework is a complete processing pipeline, including pre-processing, a multi-view stereo (MVS) network for satellite imagery (Sat-MVSNet), and post-processing. The pre-processing handles the geometric and radiometric configuration of the multi-view images and their cropping. The cropped multi-view patches are then fed into SatMVSNet, which includes deep feature extraction, rational polynomial camera (RPC) warping, pyramid cost volume construction, regularization, and regression, to obtain the height maps. The error matches are then filtered out and a digital surface model (DSM) is generated in the post-processing. Considering the complexity and diversity of real-world scenes, we also introduce a self-refinement strategy that does not require any groundtruth labels to enhance the performance and robustness of the Sat-MVSF framework. We comprehensively compare the proposed framework with popular commercial software and open-source methods, to demonstrate the potential of the proposed deep learning framework. On the WHU-TLC dataset, where the images are captured with a three-line camera (TLC), the proposed framework outperforms all the other solutions in terms of reconstruction fineness, and also outperforms most of the other methods in terms of efficiency. On the challenging MVS3D dataset, where the images are captured by the WorldView-3 satellite at different times and seasons, the proposed framework also exceeds the existing methods when using the model pretrained on aerial images and the introduced self-refinement strategy, demonstrating a high generalization ability. We also note that the lack of training samples hinders research in this field, and the availability of more high-quality open-source training data will greatly accelerate the research into deep learning based MVS satellite image reconstruction. The code will be available at https://gpcv.whu.edu.cn/data.
引用
收藏
页码:446 / 461
页数:16
相关论文
共 50 条
  • [21] Pruning multi-view stereo net for efficient 3D reconstruction
    Xiang, Xiang
    Wang, Zhiyuan
    Lao, Shanshan
    Zhang, Baochang
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 168 (168) : 17 - 27
  • [22] Revisiting PatchMatch Multi-View Stereo for Urban 3D Reconstruction
    Orsingher, Marco
    Zani, Paolo
    Medici, Paolo
    Bertozzi, Massimo
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 190 - 196
  • [23] An attention-based and deep sparse priori cascade multi-view stereo network for 3D reconstruction
    Wang, Yadong
    Ran, Teng
    Liang, Yuan
    Zheng, Guoquan
    COMPUTERS & GRAPHICS-UK, 2023, 116 : 383 - 392
  • [24] 3D SEMANTIC SEGMENTATION FROM MULTI-VIEW OPTICAL SATELLITE IMAGES
    d'Angelo, Pablo
    Cerra, Daniele
    Azimi, Seyed Majid
    Merkle, Nina
    Tian, Jiaojiao
    Auer, Stefan
    Pato, Miguel
    de los Reyes, Raquel
    Zhuo, Xiangyu
    Bittner, Ksenia
    Krauss, Thomas
    Reinartz, Peter
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 5053 - 5056
  • [25] 3D Concept Learning and Reasoning from Multi-View Images
    Hong, Yining
    Lin, Chunru
    Du, Yilun
    Chen, Zhenfang
    Tenenbaum, Joshua B.
    Gan, Chuang
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9202 - 9212
  • [26] Towards Dense 3D Reconstruction for Mixed Reality in Healthcare: Classical Multi-View Stereo vs Deep Learning
    Prokopetc, Kristina
    Dupont, Romain
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2061 - 2069
  • [27] Multi-View Images 3D Reconstruction based on Spatial Geometric Constraint
    Liu, Haibo
    PROCEEDINGS OF THE 2016 2ND WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS, 2016, 81 : 1217 - 1220
  • [28] 3D Clothed Human Reconstruction from Sparse Multi-View Images
    Hong, Jin Gyu
    Noh, Seung Young
    Lee, Hee Kyung
    Cheong, Won Sik
    Chang, Ju Yong
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2024, : 677 - 687
  • [29] Deep Learning for 3D Scene Reconstruction and Segmentation from Stereo Images
    Kniaz, Vladimir V.
    Knyaz, Vladimir A.
    Ippolitov, Evgeny, V
    Novikov, Mikhail M.
    Grodzistky, Lev
    Moshkantsev, Petr
    MULTIMODAL SENSING AND ARTIFICIAL INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS II, 2021, 11785
  • [30] View Planning for Multi-View Stereo 3D Reconstruction Using an Autonomous Multicopter
    Korbinian Schmid
    Heiko Hirschmüller
    Andreas Dömel
    Iris Grixa
    Michael Suppa
    Gerd Hirzinger
    Journal of Intelligent & Robotic Systems, 2012, 65 : 309 - 323