A general deep learning based framework for 3D reconstruction from multi-view stereo satellite images

被引：25

作者：

Gao, Jian ^{[1
]}

Liu, Jin ^{[1
]}

Ji, Shunping ^{[1
]}

机构：

[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, 129 Luoyu Rd, Wuhan 430079, Peoples R China

来源：

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING | 2023年 / 195卷

基金：

中国国家自然科学基金;

关键词：

Multi -view stereo; Optical satellite images; Deep learning; Dense matching; 3D reconstruction; IKONOS; MODEL;

D O I：

10.1016/j.isprsjprs.2022.12.012

中图分类号：

P9 [自然地理学];

学科分类号：

0705 ; 070501 ;

摘要：

In this paper, we propose a general deep learning based framework, named Sat-MVSF, to perform threedimensional (3D) reconstruction of the Earth's surface from multi-view optical satellite images. The framework is a complete processing pipeline, including pre-processing, a multi-view stereo (MVS) network for satellite imagery (Sat-MVSNet), and post-processing. The pre-processing handles the geometric and radiometric configuration of the multi-view images and their cropping. The cropped multi-view patches are then fed into SatMVSNet, which includes deep feature extraction, rational polynomial camera (RPC) warping, pyramid cost volume construction, regularization, and regression, to obtain the height maps. The error matches are then filtered out and a digital surface model (DSM) is generated in the post-processing. Considering the complexity and diversity of real-world scenes, we also introduce a self-refinement strategy that does not require any groundtruth labels to enhance the performance and robustness of the Sat-MVSF framework. We comprehensively compare the proposed framework with popular commercial software and open-source methods, to demonstrate the potential of the proposed deep learning framework. On the WHU-TLC dataset, where the images are captured with a three-line camera (TLC), the proposed framework outperforms all the other solutions in terms of reconstruction fineness, and also outperforms most of the other methods in terms of efficiency. On the challenging MVS3D dataset, where the images are captured by the WorldView-3 satellite at different times and seasons, the proposed framework also exceeds the existing methods when using the model pretrained on aerial images and the introduced self-refinement strategy, demonstrating a high generalization ability. We also note that the lack of training samples hinders research in this field, and the availability of more high-quality open-source training data will greatly accelerate the research into deep learning based MVS satellite image reconstruction. The code will be available at https://gpcv.whu.edu.cn/data.

引用

页码：446 / 461

页数：16

共 50 条

[1] Deep learning based multi-view stereo matching and 3D scene reconstruction from oblique aerial images
Liu, Jin
Gao, Jian
Ji, Shunping
Zeng, Chang
Zhang, Shaoyi
Gong, Jianya
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 204 : 42 - 60
[2] Underwater 3D reconstruction based on multi-view stereo
Gu, Feifei
Zhao, Juan
Xu, Pei
Huang, Shulan
Zhang, Gaopeng
Song, Zhan
OCEAN OPTICS AND INFORMATION TECHNOLOGY, 2018, 10850
[3] Review of multi-view stereo reconstruction methods based on deep learning
Yan H.
Xu F.
Huang L.
Liu C.
Lin C.
Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2023, 31 (16): : 2444 - 2464
[4] An attention-based and deep sparse priori cascade multi-view stereo network for 3D reconstruction
Wang, Yadong
Ran, Teng
Liang, Yuan
Zheng, Guoquan
COMPUTERS & GRAPHICS-UK, 2023, 116 : 383 - 392
[5] Pruning multi-view stereo net for efficient 3D reconstruction
Xiang, Xiang
Wang, Zhiyuan
Lao, Shanshan
Zhang, Baochang
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 168 (168) : 17 - 27
[6] Attention aware cost volume pyramid based multi-view stereo network for 3D reconstruction
Yu, Anzhu
Guo, Wenyue
Liu, Bing
Chen, Xin
Wang, Xin
Cao, Xuefeng
Jiang, Bingchuan
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 175 : 448 - 460
[7] Multi-view 3D reconstruction based on deep learning: A survey and comparison of methods
Wu, Juhao
Wyman, Omar
Tang, Yadong
Pasini, Damiano
Wang, Wenlong
NEUROCOMPUTING, 2024, 582
[8] A Framework for 3D Model Acquisition from Multi-View Images
Duan, Chunmei
PROCEEDINGS OF 2013 CHINESE INTELLIGENT AUTOMATION CONFERENCE: INTELLIGENT INFORMATION PROCESSING, 2013, 256 : 395 - 402
[9] Multi-view stereo algorithms based on deep learning: a survey
Huang, Hongbo
Yan, Xiaoxu
Zheng, Yaolin
He, Jiayu
Xu, Longfei
Qin, Dechun
Multimedia Tools and Applications, 2025, 84 (06) : 2877 - 2908
[10] Engineering Monitoring and Change Detection for Multi-View Stereo 3D Reconstruction Technology
Chang T.-R.
Lee L.-H.
Journal of the Chinese Institute of Civil and Hydraulic Engineering, 2019, 31 (04): : 337 - 350

← 1 2 3 4 5 →