A general deep learning based framework for 3D reconstruction from multi-view stereo satellite images

被引:25
|
作者
Gao, Jian [1 ]
Liu, Jin [1 ]
Ji, Shunping [1 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, 129 Luoyu Rd, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi -view stereo; Optical satellite images; Deep learning; Dense matching; 3D reconstruction; IKONOS; MODEL;
D O I
10.1016/j.isprsjprs.2022.12.012
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
In this paper, we propose a general deep learning based framework, named Sat-MVSF, to perform threedimensional (3D) reconstruction of the Earth's surface from multi-view optical satellite images. The framework is a complete processing pipeline, including pre-processing, a multi-view stereo (MVS) network for satellite imagery (Sat-MVSNet), and post-processing. The pre-processing handles the geometric and radiometric configuration of the multi-view images and their cropping. The cropped multi-view patches are then fed into SatMVSNet, which includes deep feature extraction, rational polynomial camera (RPC) warping, pyramid cost volume construction, regularization, and regression, to obtain the height maps. The error matches are then filtered out and a digital surface model (DSM) is generated in the post-processing. Considering the complexity and diversity of real-world scenes, we also introduce a self-refinement strategy that does not require any groundtruth labels to enhance the performance and robustness of the Sat-MVSF framework. We comprehensively compare the proposed framework with popular commercial software and open-source methods, to demonstrate the potential of the proposed deep learning framework. On the WHU-TLC dataset, where the images are captured with a three-line camera (TLC), the proposed framework outperforms all the other solutions in terms of reconstruction fineness, and also outperforms most of the other methods in terms of efficiency. On the challenging MVS3D dataset, where the images are captured by the WorldView-3 satellite at different times and seasons, the proposed framework also exceeds the existing methods when using the model pretrained on aerial images and the introduced self-refinement strategy, demonstrating a high generalization ability. We also note that the lack of training samples hinders research in this field, and the availability of more high-quality open-source training data will greatly accelerate the research into deep learning based MVS satellite image reconstruction. The code will be available at https://gpcv.whu.edu.cn/data.
引用
收藏
页码:446 / 461
页数:16
相关论文
共 50 条
  • [11] Prior depth-based multi-view stereo network for online 3D model reconstruction
    Song, Soohwan
    Truong, Khang Giang
    Kim, Daekyum
    Jo, Sungho
    PATTERN RECOGNITION, 2023, 136
  • [12] 3D Clothed Human Reconstruction from Sparse Multi-View Images
    Hong, Jin Gyu
    Noh, Seung Young
    Lee, Hee Kyung
    Cheong, Won Sik
    Chang, Ju Yong
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2024, : 677 - 687
  • [13] Deep Learning for 3D Scene Reconstruction and Segmentation from Stereo Images
    Kniaz, Vladimir V.
    Knyaz, Vladimir A.
    Ippolitov, Evgeny, V
    Novikov, Mikhail M.
    Grodzistky, Lev
    Moshkantsev, Petr
    MULTIMODAL SENSING AND ARTIFICIAL INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS II, 2021, 11785
  • [14] View Planning for Multi-View Stereo 3D Reconstruction Using an Autonomous Multicopter
    Korbinian Schmid
    Heiko Hirschmüller
    Andreas Dömel
    Iris Grixa
    Michael Suppa
    Gerd Hirzinger
    Journal of Intelligent & Robotic Systems, 2012, 65 : 309 - 323
  • [15] View Planning for Multi-View Stereo 3D Reconstruction Using an Autonomous Multicopter
    Schmid, Korbinian
    Hirschmueller, Heiko
    Doemel, Andreas
    Grixa, Iris
    Suppa, Michael
    Hirzinger, Gerd
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2012, 65 (1-4) : 309 - 323
  • [16] Multi-view stereo in the Deep Learning Era: A comprehensive revfiew
    Wang, Xiang
    Wang, Chen
    Liu, Bing
    Zhou, Xiaoqing
    Zhang, Liang
    Zheng, Jin
    Bai, Xiao
    DISPLAYS, 2021, 70
  • [17] GEMVS: a novel approach for automatic 3D reconstruction from uncalibrated multi-view Google Earth images using multi-view stereo and projective to metric 3D homography transformation
    Park, Soon-Yong
    Seo, DongUk
    Lee, Min-Jae
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (09) : 3005 - 3030
  • [18] DSM Reconstruction from Uncalibrated Multi-View Satellite Stereo Images by RPC Estimation and Integration
    Seo, Dong-Uk
    Park, Soon-Yong
    REMOTE SENSING, 2024, 16 (20)
  • [19] INVESTIGATING SPHERICAL EPIPOLAR RECTIFICATION FOR MULTI-VIEW STEREO 3D RECONSTRUCTION
    Elhashash, M.
    Qin, R.
    XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II, 2022, 5-2 : 47 - 52
  • [20] Deep learning-based 3D reconstruction from multiple images: A survey
    Wang, Chuhua
    Reza, Md Alimoor
    Vats, Vibhas
    Ju, Yingnan
    Thakurdesai, Nikhil
    Wang, Yuchen
    Crandall, David J.
    Jung, Soon-heung
    Seo, Jeongil
    NEUROCOMPUTING, 2024, 597