A general deep learning based framework for 3D reconstruction from multi-view stereo satellite images

被引:25
|
作者
Gao, Jian [1 ]
Liu, Jin [1 ]
Ji, Shunping [1 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, 129 Luoyu Rd, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi -view stereo; Optical satellite images; Deep learning; Dense matching; 3D reconstruction; IKONOS; MODEL;
D O I
10.1016/j.isprsjprs.2022.12.012
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
In this paper, we propose a general deep learning based framework, named Sat-MVSF, to perform threedimensional (3D) reconstruction of the Earth's surface from multi-view optical satellite images. The framework is a complete processing pipeline, including pre-processing, a multi-view stereo (MVS) network for satellite imagery (Sat-MVSNet), and post-processing. The pre-processing handles the geometric and radiometric configuration of the multi-view images and their cropping. The cropped multi-view patches are then fed into SatMVSNet, which includes deep feature extraction, rational polynomial camera (RPC) warping, pyramid cost volume construction, regularization, and regression, to obtain the height maps. The error matches are then filtered out and a digital surface model (DSM) is generated in the post-processing. Considering the complexity and diversity of real-world scenes, we also introduce a self-refinement strategy that does not require any groundtruth labels to enhance the performance and robustness of the Sat-MVSF framework. We comprehensively compare the proposed framework with popular commercial software and open-source methods, to demonstrate the potential of the proposed deep learning framework. On the WHU-TLC dataset, where the images are captured with a three-line camera (TLC), the proposed framework outperforms all the other solutions in terms of reconstruction fineness, and also outperforms most of the other methods in terms of efficiency. On the challenging MVS3D dataset, where the images are captured by the WorldView-3 satellite at different times and seasons, the proposed framework also exceeds the existing methods when using the model pretrained on aerial images and the introduced self-refinement strategy, demonstrating a high generalization ability. We also note that the lack of training samples hinders research in this field, and the availability of more high-quality open-source training data will greatly accelerate the research into deep learning based MVS satellite image reconstruction. The code will be available at https://gpcv.whu.edu.cn/data.
引用
收藏
页码:446 / 461
页数:16
相关论文
共 50 条
  • [31] EMVS: Event-Based Multi-View Stereo—3D Reconstruction with an Event Camera in Real-Time
    Henri Rebecq
    Guillermo Gallego
    Elias Mueggler
    Davide Scaramuzza
    International Journal of Computer Vision, 2018, 126 : 1394 - 1414
  • [32] Adaptive Interaction-Based Multi-view 3D Object Reconstruction
    Miao, Jun
    Zheng, Yilin
    Yan, Jie
    Li, Lei
    Chu, Jun
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT II, 2023, 14255 : 51 - 64
  • [33] SA-MVSNet: Self-attention-based multi-view stereo network for 3D reconstruction of images with weak texture
    Yang, Ronghao
    Miao, Wang
    Zhang, Zhenxin
    Liu, Zhenlong
    Li, Mubai
    Lin, Bin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 131
  • [34] Review of multi-view 3D object recognition methods based on deep learning
    Qi, Shaohua
    Ning, Xin
    Yang, Guowei
    Zhang, Liping
    Long, Peng
    Cai, Weiwei
    Li, Weijun
    DISPLAYS, 2021, 69
  • [35] Cortical Volumetry using 3D Reconstruction of Metacarpal Bone from Multi-view Images
    Jayakar, Avinash D.
    Sambath, Gautham
    Areeckal, Anu Shaju
    David, Sumam S.
    2018 IEEE RECENT ADVANCES IN INTELLIGENT COMPUTATIONAL SYSTEMS (RAICS), 2018, : 79 - 83
  • [36] HighRes-MVSNet: A Fast Multi-View Stereo Network for Dense 3D Reconstruction From High-Resolution Images
    Weilharter, Rafael
    Fraundorfer, Friedrich
    IEEE ACCESS, 2021, 9 : 11306 - 11315
  • [37] Multi-view 3D face reconstruction with deep recurrent neural networks
    Dou, Pengfei
    Kakadiaris, Ioannis A.
    IMAGE AND VISION COMPUTING, 2018, 80 : 80 - 91
  • [38] Unsupervised 3D reconstruction method based on multi-view propagation
    Luo J.
    Yuan D.
    Zhang L.
    Qu Y.
    Su S.
    Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2024, 42 (01): : 129 - 137
  • [39] MVS-T: A Coarse-to-Fine Multi-View Stereo Network with Transformer for Low-Resolution Images 3D Reconstruction
    Jia, Ruiming
    Chen, Xin
    Cui, Jiali
    Hu, Zhenghui
    SENSORS, 2022, 22 (19)
  • [40] Technical Consideration towards Robust 3D Reconstruction with Multi-View Active Stereo Sensors
    Jang, Mingyu
    Lee, Seongmin
    Kang, Jiwoo
    Lee, Sanghoon
    SENSORS, 2022, 22 (11)