A general deep learning based framework for 3D reconstruction from multi-view stereo satellite images

被引:25
|
作者
Gao, Jian [1 ]
Liu, Jin [1 ]
Ji, Shunping [1 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, 129 Luoyu Rd, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi -view stereo; Optical satellite images; Deep learning; Dense matching; 3D reconstruction; IKONOS; MODEL;
D O I
10.1016/j.isprsjprs.2022.12.012
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
In this paper, we propose a general deep learning based framework, named Sat-MVSF, to perform threedimensional (3D) reconstruction of the Earth's surface from multi-view optical satellite images. The framework is a complete processing pipeline, including pre-processing, a multi-view stereo (MVS) network for satellite imagery (Sat-MVSNet), and post-processing. The pre-processing handles the geometric and radiometric configuration of the multi-view images and their cropping. The cropped multi-view patches are then fed into SatMVSNet, which includes deep feature extraction, rational polynomial camera (RPC) warping, pyramid cost volume construction, regularization, and regression, to obtain the height maps. The error matches are then filtered out and a digital surface model (DSM) is generated in the post-processing. Considering the complexity and diversity of real-world scenes, we also introduce a self-refinement strategy that does not require any groundtruth labels to enhance the performance and robustness of the Sat-MVSF framework. We comprehensively compare the proposed framework with popular commercial software and open-source methods, to demonstrate the potential of the proposed deep learning framework. On the WHU-TLC dataset, where the images are captured with a three-line camera (TLC), the proposed framework outperforms all the other solutions in terms of reconstruction fineness, and also outperforms most of the other methods in terms of efficiency. On the challenging MVS3D dataset, where the images are captured by the WorldView-3 satellite at different times and seasons, the proposed framework also exceeds the existing methods when using the model pretrained on aerial images and the introduced self-refinement strategy, demonstrating a high generalization ability. We also note that the lack of training samples hinders research in this field, and the availability of more high-quality open-source training data will greatly accelerate the research into deep learning based MVS satellite image reconstruction. The code will be available at https://gpcv.whu.edu.cn/data.
引用
收藏
页码:446 / 461
页数:16
相关论文
共 50 条
  • [11] Review of multi-view stereo reconstruction methods based on deep learning
    Yan H.
    Xu F.
    Huang L.
    Liu C.
    Lin C.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2023, 31 (16): : 2444 - 2464
  • [12] Multi-view 3D reconstruction based on deep learning: A survey and comparison of methods
    Wu, Juhao
    Wyman, Omar
    Tang, Yadong
    Pasini, Damiano
    Wang, Wenlong
    Neurocomputing, 2024, 582
  • [13] Multi-view 3D reconstruction based on deep learning: A survey and comparison of methods
    Wu, Juhao
    Wyman, Omar
    Tang, Yadong
    Pasini, Damiano
    Wang, Wenlong
    NEUROCOMPUTING, 2024, 582
  • [14] A Scaled Monocular 3D Reconstruction Based on Structure from Motion and Multi-View Stereo
    Zhan, Zhiwen
    Yang, Fan
    Jiang, Jixin
    Du, Jialin
    Li, Fanxing
    Sun, Si
    Wei, Yan
    ELECTRONICS, 2024, 13 (19)
  • [15] A Framework for 3D Model Acquisition from Multi-View Images
    Duan, Chunmei
    PROCEEDINGS OF 2013 CHINESE INTELLIGENT AUTOMATION CONFERENCE: INTELLIGENT INFORMATION PROCESSING, 2013, 256 : 395 - 402
  • [16] 3D SURFACE RECONSTRUCTION FROM MULTI-VIEW AND MULTI-DATE GOOGLE EARTH SATELLITE IMAGES WITH 3D HOMOGRAPHY-BASED PROJECTIVE RECONSTRUCTION
    Lee, M. J.
    Park, S. Y.
    XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II, 2022, 43-B2 : 135 - 140
  • [17] Incremental Multi-view 3D Reconstruction Starting from Two Images Taken by a Stereo Pair of Cameras
    El Hazzat, Soulaiman
    Saaidi, Abderrahim
    Karam, Antoine
    Satori, Khalid
    3D RESEARCH, 2015, 6 (01)
  • [18] Combining Photometric Normals and Multi-View Stereo for 3D Reconstruction
    Grochulla, Martin
    Thormaehlen, Thorsten
    CVMP 2015: PROCEEDINGS OF THE 12TH EUROPEAN CONFERENCE ON VISUAL MEDIA PRODUCTION, 2015,
  • [19] Improvement on Matching Breakage of Multi-View Stereo 3D Reconstruction
    Lin, Hung-Lin
    Lin, Tsung-Yi
    Li, Yi-Xuan
    Tseng, Yu-Sheng
    Li, Xin-Yi
    Cal, Qlan-Wen
    Chen, Zheng
    Shi, Yi-Rou
    PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON ADVANCED MATERIALS FOR SCIENCE AND ENGINEERING (IEEE-ICAMSE 2016), 2016, : 423 - 425
  • [20] Multi-view stereo for weakly textured indoor 3D reconstruction
    Wang, Tao
    Gan, Vincent J. L.
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2024, 39 (10) : 1469 - 1489