Elevation Estimation-Driven Building 3-D Reconstruction From Single-View Remote Sensing Imagery

被引：15

作者：

Mao, Yongqiang ^{[1
,2
,3
,4
]}

Chen, Kaiqiang ^{[1
,2
]}

Zhao, Liangjin ^{[1
,2
]}

Chen, Wei ^{[5
]}

Tang, Deke ^{[5
]}

Liu, Wenjie ^{[1
,2
,3
,4
]}

Wang, Zhirui ^{[1
,2
]}

Diao, Wenhui ^{[1
,2
]}

Sun, Xian ^{[1
,2
,3
,4
]}

Fu, Kun ^{[1
,2
,3
,4
]}

机构：

[1] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100190, Peoples R China

[2] Chinese Acad Sci, Key Lab Network Informat Syst Technol NIST, Aerosp Informat Res Inst, Beijing 100190, Peoples R China

[3] Univ Chinese Acad Sci, Beijing 100190, Peoples R China

[4] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100190, Peoples R China

[5] Geovis Technol Co Ltd, Hefei, Peoples R China

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷

关键词：

Buildings; Image reconstruction; Three-dimensional displays; Point cloud compression; Solid modeling; Semantics; Remote sensing; 3-D building reconstruction; DSM estimation; elevation semantic flow (ESF); remote sensing images; 3D RECONSTRUCTION; OBJECT DETECTION; AERIAL IMAGES; POINT CLOUDS; MODELS;

D O I：

10.1109/TGRS.2023.3266477

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Building 3-D reconstruction from remote sensing images has a wide range of applications in smart cities, photogrammetry, and other fields. Methods for automatic 3-D urban building modeling typically employ multiview images as input to algorithms to recover point clouds and 3-D models of buildings. However, such models rely heavily on multiview images of buildings, which are time-intensive and limit the applicability and practicality of the models. To solve these issues, we focus on designing an efficient DSM estimation-driven reconstruction framework (Building3-D), which aims to reconstruct 3-D building models from the input single-view remote sensing image. Existing DSM estimation networks suffer from the imbalance between local and global features, which leads to oversmooth DSM estimates at instance boundaries. To address this issue, we propose a Semantic Flow Field-guided DSM Estimation (SFFDE) network, which utilizes the proposed concept of elevation semantic flow (ESF) to achieve the registration of local and global features. First, in order to make the network semantics globally aware, we propose an elevation semantic globalization (ESG) module to realize the semantic globalization of instances. Furthermore, in order to alleviate the semantic span of global features and original local features, we propose a local-to-global elevation semantic registration (L2G-ESR) module based on ESF. Our Building3-D is rooted in the SFFDE network for building elevation prediction, synchronized with a building extraction network for building masks, and then sequentially performs point cloud reconstruction and surface reconstruction (or CityGML model reconstruction). On this basis, our Building3-D can optionally generate CityGML models or surface mesh models of the buildings. Extensive experiments on ISPRS Vaihingen and DFC2019 datasets on the DSM estimation task show that our SFFDE significantly improves upon state-of-the-art, and d1, d2, and d3 metrics of our SFFDE are improved to 0.595, 0.897, and 0.970. Furthermore, our Building3D achieves impressive results in the 3-D point cloud and 3-D model reconstruction process.

引用

页数：18

共 74 条

[1] 2D Image-To-3D Model: Knowledge-Based 3D Building Reconstruction (3DBR) Using Single Aerial Images and Convolutional Neural Networks (CNNs)
Alidoost, Fatemeh
Arefi, Hossein
Tombari, Federico
[J]. REMOTE SENSING, 2019, 11 (19)
[2] Height estimation from single aerial images using a deep convolutional encoder-decoder network
Amirkolaee, Hamed Amini
Arefi, Hossein
[J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 149 : 50 - 66
[3] Ba JL., 2016, arXiv
[4] Baatz M., 1999, P 2 INT S OPERATIONA
[5] Batra D, 2012, PROC CVPR IEEE, P2136, DOI 10.1109/CVPR.2012.6247920
[6] AdaBins: Depth Estimation Using Adaptive Bins
Bhat, Shariq Farooq
Alhashim, Ibraheem
Wonka, Peter
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4008 - 4017
[7] Semantic Stereo for Incidental Satellite Images
Bosch, Marc
Foster, Kevin
Christie, Gordon
Wang, Sean
Hager, Gregory D.
Brown, Myron
[J]. 2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1524 - 1532
[8] Context-based automatic reconstruction and texturing of 3D urban terrain for quick-response tasks
Bulatov, Dimitri
Haeufel, Gisela
Meidow, Jochen
Pohl, Melanie
Solbrig, Peter
Wernerus, Peter
[J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 93 : 157 - 170
[9] Carvalho M, 2018, IEEE IMAGE PROC, P2915, DOI 10.1109/ICIP.2018.8451312
[10] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
Chen, Liang-Chieh
Zhu, Yukun
Papandreou, George
Schroff, Florian
Adam, Hartwig
[J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851

← 1 2 3 4 5 6 7 8 →