Deep learning based multi-view dense matching with joint depth and surface normal estimation

被引:0
作者
Liu, Jin [1 ,2 ]
Ji, Shunping [1 ]
机构
[1] School of Remote Sensing and Information Engineering, Wuhan University, Wuhan
[2] School of Communication Engineering, Hangzhou Dianzi University, Hangzhou
来源
Cehui Xuebao/Acta Geodaetica et Cartographica Sinica | 2025年 / 53卷 / 12期
基金
中国国家自然科学基金;
关键词
3D reconstruction; deep learning; depth estimation; multi-view dense matching; normal estimation;
D O I
10.11947/j.AGCS.2024.20230579
中图分类号
学科分类号
摘要
In recent years, deep learning-based multi-view stereo matching methods have demonstrated significant potential in 3D reconstruction tasks. Ho wever, they still exhibit limitations in recovering fine geometrie details of scènes. In some traditional multi-view stereo matching methods, surface normal of ten serves as a crucial geometrie constraint to assist in finer depth inference. Nevertheless, the surface normal information, which encapsulates the geometrie information of the scène, has not been fully utilized in modern learning-based methods. This paper introduces a deep learning-based joint depth and surface normal estimation method for multi-view dense matching and 3D scène reconstruction task. The proposed method employs a multi-stage pyramid structure to simultaneously infer depth and surface normal from multi-view images and promote their joint optimization. It consists of a feature extraction module, a normal-assisted depth estimation module, a depth-assisted normal estimation module, and a depth-normal joint optimization module. Specifically, the depth estimation module constructs a geometry-aware cost volume by integrating surface normal information for fine depth estimation. The normal estimation module utilizes depth constraints to build a local cost volume for inferring fine-grained normal maps. The joint optimization module further enhances the geometrie consistency between depth and normal estimation. Experimental results on the WHU-OMVS dataset demonstrate that the proposed method performs exceptionally well in both depth and surface normal estimation, outperforming existing methods. Furthermore, the 3D reconstruction results on two different datasets indicate that the proposed method effectively recovers the geometrie structures of both local high-curvature areas and global planar regions, contributing to well-structured and high-quality 3D scène models. © 2025 SinoMaps Press. All rights reserved.
引用
收藏
页码:2391 / 2403
页数:12
相关论文
共 31 条
  • [1] DONG Xiujun, DENG Bo, YUAN Feiyun, Et al., Application of aerial remote sensing in geological hazards: current situation and pros-pects, Geomatics and Information Science of Wuhan University, 48, 12, pp. 1897-1913, (2023)
  • [2] SHA Hongjun, YUAN Xiuxiao, State-of-the-art binocular image dense matching method, Geomatics and Information Science of Wuhan University, 48, 11, pp. 1813-1833, (2023)
  • [3] HONG Danfeng, ZHANG Bing, LI Hao, Et al., Cross-city matters: a multimodal remote sensing benchmark dataset for cross-city se-mantic segmentation using high-resolution domain adaptation networks, Remote Sensing of Environment, 299, (2023)
  • [4] HONG Danfeng, ZHANG Bing, LI Xuyang, Et al., SpectralGPT: spectral remote sensing foundation model[j], IEEE Transactions on Pattern Analysis and Machine Intelligence, 46, 8, pp. 5227-5244, (2024)
  • [5] KIRILLOV A, MINTUN E, RAVI N, Et al., Segment anythmg, Proceedings of 2023 IEEE/CVF International Conference on Computer Vision, pp. 4015-4026, (2023)
  • [6] SUN Xian, WANG Peijin, LU Wanxuan, Et al., RingMo: a remote sensing foundation model with masked image modeling[j], IEEE Transactions on Geoscience and Remote Sensing, 61, (2023)
  • [7] LIU Jin, JI Shunping, Deep learning based dense matching for aerial remote sensing images[j], Acta Geodaetica et Cartographica Sinica, 48, 9, pp. 1141-1150, (2019)
  • [8] JI Shunping, LIU Jin, LU Meng, CNN-based dense image matching for aerial remote sensing images, Photogrammetric Engineering &. Remote Sensing, 85, 6, pp. 415-424, (2019)
  • [9] JI Shunping, LUO Chong, LIU Jin, A review of dense stereo image matching methods based on deep learning[j], Geomatics and Information Science of Wuhan University, 46, 2, pp. 193-202, (2021)
  • [10] GONG Jianya, JI Shunping, Photogrammetry and deep learning, Acta Geodaetica et Cartographica Sinica, 47, 6, pp. 693-704, (2018)