A general deep learning based framework for 3D reconstruction from multi-view stereo satellite images

被引:25
|
作者
Gao, Jian [1 ]
Liu, Jin [1 ]
Ji, Shunping [1 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, 129 Luoyu Rd, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi -view stereo; Optical satellite images; Deep learning; Dense matching; 3D reconstruction; IKONOS; MODEL;
D O I
10.1016/j.isprsjprs.2022.12.012
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
In this paper, we propose a general deep learning based framework, named Sat-MVSF, to perform threedimensional (3D) reconstruction of the Earth's surface from multi-view optical satellite images. The framework is a complete processing pipeline, including pre-processing, a multi-view stereo (MVS) network for satellite imagery (Sat-MVSNet), and post-processing. The pre-processing handles the geometric and radiometric configuration of the multi-view images and their cropping. The cropped multi-view patches are then fed into SatMVSNet, which includes deep feature extraction, rational polynomial camera (RPC) warping, pyramid cost volume construction, regularization, and regression, to obtain the height maps. The error matches are then filtered out and a digital surface model (DSM) is generated in the post-processing. Considering the complexity and diversity of real-world scenes, we also introduce a self-refinement strategy that does not require any groundtruth labels to enhance the performance and robustness of the Sat-MVSF framework. We comprehensively compare the proposed framework with popular commercial software and open-source methods, to demonstrate the potential of the proposed deep learning framework. On the WHU-TLC dataset, where the images are captured with a three-line camera (TLC), the proposed framework outperforms all the other solutions in terms of reconstruction fineness, and also outperforms most of the other methods in terms of efficiency. On the challenging MVS3D dataset, where the images are captured by the WorldView-3 satellite at different times and seasons, the proposed framework also exceeds the existing methods when using the model pretrained on aerial images and the introduced self-refinement strategy, demonstrating a high generalization ability. We also note that the lack of training samples hinders research in this field, and the availability of more high-quality open-source training data will greatly accelerate the research into deep learning based MVS satellite image reconstruction. The code will be available at https://gpcv.whu.edu.cn/data.
引用
收藏
页码:446 / 461
页数:16
相关论文
共 50 条
  • [31] View Planning for Multi-View Stereo 3D Reconstruction Using an Autonomous Multicopter
    Schmid, Korbinian
    Hirschmueller, Heiko
    Doemel, Andreas
    Grixa, Iris
    Suppa, Michael
    Hirzinger, Gerd
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2012, 65 (1-4) : 309 - 323
  • [32] REPRESENTATION LEARNING OF VERTEX HEATMAPS FOR 3D HUMAN MESH RECONSTRUCTION FROM MULTI-VIEW IMAGES
    Chun, Sungho
    Park, Sungbum
    Chang, Ju Yong
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 670 - 674
  • [33] GEMVS: a novel approach for automatic 3D reconstruction from uncalibrated multi-view Google Earth images using multi-view stereo and projective to metric 3D homography transformation
    Park, Soon-Yong
    Seo, DongUk
    Lee, Min-Jae
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (09) : 3005 - 3030
  • [34] DSM Reconstruction from Uncalibrated Multi-View Satellite Stereo Images by RPC Estimation and Integration
    Seo, Dong-Uk
    Park, Soon-Yong
    REMOTE SENSING, 2024, 16 (20)
  • [35] DETransMVSnet: Research on Terahertz 3D Reconstruction of Multi-View Stereo Network With Deep Equilibrium Transformers
    Bai, Fan
    Li, Lun
    Wang, Wencheng
    Wu, Xiaojin
    IEEE ACCESS, 2023, 11 : 146042 - 146053
  • [36] INVESTIGATING SPHERICAL EPIPOLAR RECTIFICATION FOR MULTI-VIEW STEREO 3D RECONSTRUCTION
    Elhashash, M.
    Qin, R.
    XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II, 2022, 5-2 : 47 - 52
  • [37] User-guided 3D reconstruction using multi-view stereo
    Rasmuson, Sverker
    Sintorn, Erik
    Assarsson, Ulf
    I3D 2020: ACM SIGGRAPH SYMPOSIUM ON INTERACTIVE 3D GRAPHICS AND GAMES, 2020,
  • [38] Accurate stereo 3D point cloud generation suitable for multi-view stereo reconstruction
    Kordelas, Georgios A.
    Daras, Petros
    Klavdianos, Patrycia
    Izquierdo, Ebroul
    Zhang, Qianni
    2014 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING CONFERENCE, 2014, : 307 - 310
  • [39] A real sense 3D face reconstruction system based on multi-view stereo vision
    Li, Ke
    Zeng, Dong
    Zhang, Jun
    Lin, Rui
    Gao, Luobin
    Liao, Xiaoli
    Journal of Information and Computational Science, 2015, 12 (10): : 3739 - 3753
  • [40] AN AUTOMATIC 3D RECONSTRUCTION METHOD BASED ON MULTI-VIEW STEREO VISION FOR THE MOGAO GROTTOES
    Xiong, Jie
    Zhong, Sidong
    Zheng, Lin
    INDOOR-OUTDOOR SEAMLESS MODELLING, MAPPING AND NAVIGATION, 2015, 44 (W5): : 171 - 176