A general deep learning based framework for 3D reconstruction from multi-view stereo satellite images

被引：25

作者：

Gao, Jian ^{[1
]}

Liu, Jin ^{[1
]}

Ji, Shunping ^{[1
]}

机构：

[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, 129 Luoyu Rd, Wuhan 430079, Peoples R China

来源：

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING | 2023年 / 195卷

基金：

中国国家自然科学基金;

关键词：

Multi -view stereo; Optical satellite images; Deep learning; Dense matching; 3D reconstruction; IKONOS; MODEL;

D O I：

10.1016/j.isprsjprs.2022.12.012

中图分类号：

P9 [自然地理学];

学科分类号：

0705 ; 070501 ;

摘要：

In this paper, we propose a general deep learning based framework, named Sat-MVSF, to perform threedimensional (3D) reconstruction of the Earth's surface from multi-view optical satellite images. The framework is a complete processing pipeline, including pre-processing, a multi-view stereo (MVS) network for satellite imagery (Sat-MVSNet), and post-processing. The pre-processing handles the geometric and radiometric configuration of the multi-view images and their cropping. The cropped multi-view patches are then fed into SatMVSNet, which includes deep feature extraction, rational polynomial camera (RPC) warping, pyramid cost volume construction, regularization, and regression, to obtain the height maps. The error matches are then filtered out and a digital surface model (DSM) is generated in the post-processing. Considering the complexity and diversity of real-world scenes, we also introduce a self-refinement strategy that does not require any groundtruth labels to enhance the performance and robustness of the Sat-MVSF framework. We comprehensively compare the proposed framework with popular commercial software and open-source methods, to demonstrate the potential of the proposed deep learning framework. On the WHU-TLC dataset, where the images are captured with a three-line camera (TLC), the proposed framework outperforms all the other solutions in terms of reconstruction fineness, and also outperforms most of the other methods in terms of efficiency. On the challenging MVS3D dataset, where the images are captured by the WorldView-3 satellite at different times and seasons, the proposed framework also exceeds the existing methods when using the model pretrained on aerial images and the introduced self-refinement strategy, demonstrating a high generalization ability. We also note that the lack of training samples hinders research in this field, and the availability of more high-quality open-source training data will greatly accelerate the research into deep learning based MVS satellite image reconstruction. The code will be available at https://gpcv.whu.edu.cn/data.

引用

页码：446 / 461

页数：16

共 50 条

[11] Prior depth-based multi-view stereo network for online 3D model reconstruction
Song, Soohwan
Truong, Khang Giang
Kim, Daekyum
Jo, Sungho
PATTERN RECOGNITION, 2023, 136
[12] 3D Clothed Human Reconstruction from Sparse Multi-View Images
Hong, Jin Gyu
Noh, Seung Young
Lee, Hee Kyung
Cheong, Won Sik
Chang, Ju Yong
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2024, : 677 - 687
[13] Deep Learning for 3D Scene Reconstruction and Segmentation from Stereo Images
Kniaz, Vladimir V.
Knyaz, Vladimir A.
Ippolitov, Evgeny, V
Novikov, Mikhail M.
Grodzistky, Lev
Moshkantsev, Petr
MULTIMODAL SENSING AND ARTIFICIAL INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS II, 2021, 11785
[14] View Planning for Multi-View Stereo 3D Reconstruction Using an Autonomous Multicopter
Korbinian Schmid
Heiko Hirschmüller
Andreas Dömel
Iris Grixa
Michael Suppa
Gerd Hirzinger
Journal of Intelligent & Robotic Systems, 2012, 65 : 309 - 323
[15] View Planning for Multi-View Stereo 3D Reconstruction Using an Autonomous Multicopter
Schmid, Korbinian
Hirschmueller, Heiko
Doemel, Andreas
Grixa, Iris
Suppa, Michael
Hirzinger, Gerd
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2012, 65 (1-4) : 309 - 323
[16] Multi-view stereo in the Deep Learning Era: A comprehensive revfiew
Wang, Xiang
Wang, Chen
Liu, Bing
Zhou, Xiaoqing
Zhang, Liang
Zheng, Jin
Bai, Xiao
DISPLAYS, 2021, 70
[17] GEMVS: a novel approach for automatic 3D reconstruction from uncalibrated multi-view Google Earth images using multi-view stereo and projective to metric 3D homography transformation
Park, Soon-Yong
Seo, DongUk
Lee, Min-Jae
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (09) : 3005 - 3030
[18] DSM Reconstruction from Uncalibrated Multi-View Satellite Stereo Images by RPC Estimation and Integration
Seo, Dong-Uk
Park, Soon-Yong
REMOTE SENSING, 2024, 16 (20)
[19] INVESTIGATING SPHERICAL EPIPOLAR RECTIFICATION FOR MULTI-VIEW STEREO 3D RECONSTRUCTION
Elhashash, M.
Qin, R.
XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II, 2022, 5-2 : 47 - 52
[20] Deep learning-based 3D reconstruction from multiple images: A survey
Wang, Chuhua
Reza, Md Alimoor
Vats, Vibhas
Ju, Yingnan
Thakurdesai, Nikhil
Wang, Yuchen
Crandall, David J.
Jung, Soon-heung
Seo, Jeongil
NEUROCOMPUTING, 2024, 597

← 1 2 3 4 5 →