Elevation Estimation-Driven Building 3-D Reconstruction From Single-View Remote Sensing Imagery

被引:15
作者
Mao, Yongqiang [1 ,2 ,3 ,4 ]
Chen, Kaiqiang [1 ,2 ]
Zhao, Liangjin [1 ,2 ]
Chen, Wei [5 ]
Tang, Deke [5 ]
Liu, Wenjie [1 ,2 ,3 ,4 ]
Wang, Zhirui [1 ,2 ]
Diao, Wenhui [1 ,2 ]
Sun, Xian [1 ,2 ,3 ,4 ]
Fu, Kun [1 ,2 ,3 ,4 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Key Lab Network Informat Syst Technol NIST, Aerosp Informat Res Inst, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100190, Peoples R China
[4] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100190, Peoples R China
[5] Geovis Technol Co Ltd, Hefei, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷
关键词
Buildings; Image reconstruction; Three-dimensional displays; Point cloud compression; Solid modeling; Semantics; Remote sensing; 3-D building reconstruction; DSM estimation; elevation semantic flow (ESF); remote sensing images; 3D RECONSTRUCTION; OBJECT DETECTION; AERIAL IMAGES; POINT CLOUDS; MODELS;
D O I
10.1109/TGRS.2023.3266477
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Building 3-D reconstruction from remote sensing images has a wide range of applications in smart cities, photogrammetry, and other fields. Methods for automatic 3-D urban building modeling typically employ multiview images as input to algorithms to recover point clouds and 3-D models of buildings. However, such models rely heavily on multiview images of buildings, which are time-intensive and limit the applicability and practicality of the models. To solve these issues, we focus on designing an efficient DSM estimation-driven reconstruction framework (Building3-D), which aims to reconstruct 3-D building models from the input single-view remote sensing image. Existing DSM estimation networks suffer from the imbalance between local and global features, which leads to oversmooth DSM estimates at instance boundaries. To address this issue, we propose a Semantic Flow Field-guided DSM Estimation (SFFDE) network, which utilizes the proposed concept of elevation semantic flow (ESF) to achieve the registration of local and global features. First, in order to make the network semantics globally aware, we propose an elevation semantic globalization (ESG) module to realize the semantic globalization of instances. Furthermore, in order to alleviate the semantic span of global features and original local features, we propose a local-to-global elevation semantic registration (L2G-ESR) module based on ESF. Our Building3-D is rooted in the SFFDE network for building elevation prediction, synchronized with a building extraction network for building masks, and then sequentially performs point cloud reconstruction and surface reconstruction (or CityGML model reconstruction). On this basis, our Building3-D can optionally generate CityGML models or surface mesh models of the buildings. Extensive experiments on ISPRS Vaihingen and DFC2019 datasets on the DSM estimation task show that our SFFDE significantly improves upon state-of-the-art, and d1, d2, and d3 metrics of our SFFDE are improved to 0.595, 0.897, and 0.970. Furthermore, our Building3D achieves impressive results in the 3-D point cloud and 3-D model reconstruction process.
引用
收藏
页数:18
相关论文
共 74 条
  • [1] 2D Image-To-3D Model: Knowledge-Based 3D Building Reconstruction (3DBR) Using Single Aerial Images and Convolutional Neural Networks (CNNs)
    Alidoost, Fatemeh
    Arefi, Hossein
    Tombari, Federico
    [J]. REMOTE SENSING, 2019, 11 (19)
  • [2] Height estimation from single aerial images using a deep convolutional encoder-decoder network
    Amirkolaee, Hamed Amini
    Arefi, Hossein
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 149 : 50 - 66
  • [3] Ba JL., 2016, arXiv
  • [4] Baatz M., 1999, P 2 INT S OPERATIONA
  • [5] Batra D, 2012, PROC CVPR IEEE, P2136, DOI 10.1109/CVPR.2012.6247920
  • [6] AdaBins: Depth Estimation Using Adaptive Bins
    Bhat, Shariq Farooq
    Alhashim, Ibraheem
    Wonka, Peter
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4008 - 4017
  • [7] Semantic Stereo for Incidental Satellite Images
    Bosch, Marc
    Foster, Kevin
    Christie, Gordon
    Wang, Sean
    Hager, Gregory D.
    Brown, Myron
    [J]. 2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1524 - 1532
  • [8] Context-based automatic reconstruction and texturing of 3D urban terrain for quick-response tasks
    Bulatov, Dimitri
    Haeufel, Gisela
    Meidow, Jochen
    Pohl, Melanie
    Solbrig, Peter
    Wernerus, Peter
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 93 : 157 - 170
  • [9] Carvalho M, 2018, IEEE IMAGE PROC, P2915, DOI 10.1109/ICIP.2018.8451312
  • [10] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851