A general deep learning based framework for 3D reconstruction from multi-view stereo satellite images

被引:25
|
作者
Gao, Jian [1 ]
Liu, Jin [1 ]
Ji, Shunping [1 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, 129 Luoyu Rd, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi -view stereo; Optical satellite images; Deep learning; Dense matching; 3D reconstruction; IKONOS; MODEL;
D O I
10.1016/j.isprsjprs.2022.12.012
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
In this paper, we propose a general deep learning based framework, named Sat-MVSF, to perform threedimensional (3D) reconstruction of the Earth's surface from multi-view optical satellite images. The framework is a complete processing pipeline, including pre-processing, a multi-view stereo (MVS) network for satellite imagery (Sat-MVSNet), and post-processing. The pre-processing handles the geometric and radiometric configuration of the multi-view images and their cropping. The cropped multi-view patches are then fed into SatMVSNet, which includes deep feature extraction, rational polynomial camera (RPC) warping, pyramid cost volume construction, regularization, and regression, to obtain the height maps. The error matches are then filtered out and a digital surface model (DSM) is generated in the post-processing. Considering the complexity and diversity of real-world scenes, we also introduce a self-refinement strategy that does not require any groundtruth labels to enhance the performance and robustness of the Sat-MVSF framework. We comprehensively compare the proposed framework with popular commercial software and open-source methods, to demonstrate the potential of the proposed deep learning framework. On the WHU-TLC dataset, where the images are captured with a three-line camera (TLC), the proposed framework outperforms all the other solutions in terms of reconstruction fineness, and also outperforms most of the other methods in terms of efficiency. On the challenging MVS3D dataset, where the images are captured by the WorldView-3 satellite at different times and seasons, the proposed framework also exceeds the existing methods when using the model pretrained on aerial images and the introduced self-refinement strategy, demonstrating a high generalization ability. We also note that the lack of training samples hinders research in this field, and the availability of more high-quality open-source training data will greatly accelerate the research into deep learning based MVS satellite image reconstruction. The code will be available at https://gpcv.whu.edu.cn/data.
引用
收藏
页码:446 / 461
页数:16
相关论文
共 50 条
  • [41] Individual Tree Detection and Crown Delineation with 3D Information from Multi-view Satellite Images
    Xiao, Changlin
    Qin, Rongjun
    Xie, Xiao
    Huang, Xu
    PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2019, 85 (01): : 55 - 63
  • [42] Deployment of a deep-learning based multi-view stereo approach for measurement of ship shell plates
    He, Pengpeng
    Hu, Delin
    Hu, Yong
    OCEAN ENGINEERING, 2022, 260
  • [43] Accurate Multi-View Stereo 3D Reconstruction for Cost-Effective Plant Phenotyping
    Lou, Lu
    Liu, Yonghuai
    Han, Jiwan
    Doonan, John H.
    IMAGE ANALYSIS AND RECOGNITION, ICIAR 2014, PT II, 2014, 8815 : 349 - 356
  • [44] Modified U-Net based 3D reconstruction model to estimate volume from multi-view images of a solid object
    Dalai, Radhamadhab
    Senapati, Kishore Kumar
    Dalai, Nibedita
    IMAGING SCIENCE JOURNAL, 2023, 71 (02): : 110 - 127
  • [45] Multi-view self-supervised learning for 3D facial texture reconstruction from single image
    Zeng, Xiaoxing
    Hu, Ruyun
    Shi, Wu
    Qiao, Yu
    IMAGE AND VISION COMPUTING, 2021, 115
  • [46] 3D Reconstruction of Aircraft Structures via 2D Multi-view Images
    Zhang, Tianyou
    Fan, Runze
    Zhang, Yu
    Feng, Guangkun
    Wei, Zhenzhong
    TENTH INTERNATIONAL SYMPOSIUM ON PRECISION MECHANICAL MEASUREMENTS, 2021, 12059
  • [47] AN IMAGE-BASED TECHNIQUE FOR 3D BUILDING RECONSTRUCTION USING MULTI-VIEW UAV IMAGES
    Alidoost, F.
    Arefi, H.
    INTERNATIONAL CONFERENCE ON SENSORS & MODELS IN REMOTE SENSING & PHOTOGRAMMETRY, 2015, 41 (W5): : 43 - 46
  • [48] MFNet: Multi-level fusion aware feature pyramid based multi-view stereo network for 3D reconstruction
    Youcheng Cai
    Lin Li
    Dong Wang
    Xiaoping Liu
    Applied Intelligence, 2023, 53 : 4289 - 4301
  • [49] MFNet: Multi-level fusion aware feature pyramid based multi-view stereo network for 3D reconstruction
    Cai, Youcheng
    Li, Lin
    Wang, Dong
    Liu, Xiaoping
    APPLIED INTELLIGENCE, 2023, 53 (04) : 4289 - 4301
  • [50] Charting the Landscape of Multi-view Stereo: An In-Depth Exploration of Deep Learning Techniques
    Zhou, Zhe
    Liu, Xiaozhang
    Tang, Xiangyan
    BIG DATA AND SECURITY, ICBDS 2023, PT I, 2024, 2099 : 152 - 165