HMSM-Net: Hierarchical multi-scale matching network for disparity estimation of high-resolution satellite stereo images

被引:26
|
作者
He, Sheng [1 ]
Li, Shenhong [1 ]
Jiang, San [2 ]
Jiang, Wanshou [1 ,3 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430079, Peoples R China
[2] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China
[3] Wuhan Univ, Collaborat Innovat Ctr Geospatial Technol, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
Satellite stereo images; Disparity estimation; Convolutional neural network; Hierarchical multi-scale matching; GaoFen-7; dataset;
D O I
10.1016/j.isprsjprs.2022.04.020
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Disparity estimation of satellite stereo images is an essential and challenging task in photogrammetry and remote sensing. Recent researches have greatly promoted the development of disparity estimation algorithms by using CNN (Convolutional Neural Networks) based deep learning techniques. However, it is still difficult to handle intractable regions that are mainly caused by occlusions, disparity discontinuities, texture-less areas, and re-petitive patterns. Besides, the lack of training datasets for satellite stereo images remains another major issue that blocks the usage of CNN techniques due to the difficulty of obtaining ground-truth disparities. In this paper, we propose an end-to-end disparity learning model, termed hierarchical multi-scale matching network (HMSM-Net), for the disparity estimation of high-resolution satellite stereo images. First, multi-scale cost volumes are con-structed by using pyramidal features that capture spatial information of multiple levels, which learn corre-spondences at multiple scales and enable HMSM-Net to be more robust in intractable regions. Second, stereo matching is executed in a hierarchical coarse-to-fine manner by applying supervision to each scale, which allows a lower scale to act as prior knowledge and guides a higher scale to attain finer matching results. Third, a refinement module that incorporates the intensity and gradient information of the input left image is designed to regress a detailed full-resolution disparity map for local structure preservation. For network training and testing, a dense stereo matching dataset is created and published by using GaoFen-7 satellite stereo images. Finally, the proposed network is evaluated on the Urban Semantic 3D and GaoFen-7 datasets. Experimental results demonstrate that HMSM-Net achieves superior accuracy compared with state-of-the-art methods, and the improvement on intractable regions is noteworthy. Additionally, results and comparisons of different methods on the GaoFen-7 dataset show that it can severs as a challenging benchmark for performance assessment of methods applied to disparity estimation of satellite stereo images. The source codes and evaluation dataset are made publicly available at https://github.com/Sheng029/HMSM-Net.
引用
收藏
页码:314 / 330
页数:17
相关论文
共 50 条
  • [21] MS-HRNet: multi-scale high-resolution network for human pose estimation
    Wang, Yanxia
    Wang, Renjie
    Shi, Hu
    Liu, Dan
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (12): : 17269 - 17291
  • [22] Multi-scale hierarchical sampling change detection using Random Forest for high-resolution satellite imagery
    Bai, Ting
    Sun, Kaimin
    Deng, Shiquan
    Li, Deren
    Li, Wenzhuo
    Chen, Yepei
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2018, 39 (21) : 7523 - 7546
  • [23] Cascaded recurrent networks with masked representation learning for stereo matching of high-resolution satellite images
    Rao, Zhibo
    Li, Xing
    Xiong, Bangshu
    Dai, Yuchao
    Shen, Zhelun
    Li, Hangbiao
    Lou, Yue
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2024, 218 : 151 - 165
  • [24] Multi-scale graph neural network for global stereo matching
    Wang, Xiaofeng
    Yu, Jun
    Sun, Zhiheng
    Sun, Jiameng
    Su, Yingying
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 118
  • [25] MPANET: MULTI-SCALE PYRAMID AGGREGATION NETWORK FOR STEREO MATCHING
    Zhu, Ziyu
    Guo, Wei
    Chen, Wei
    Li, Qiuping
    Zhao, Yong
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2773 - 2777
  • [26] Multi-Scale Cost Volumes Cascade Network for Stereo Matching
    Jia, Xiaogang
    Chen, Wei
    Liang, Chen Li Zhengfa
    Wu, Mingfei
    Tan, Yusong
    Huang, Libo
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 8657 - 8663
  • [27] DEEP LEARNING-BASED STEREO MATCHING FOR HIGH-RESOLUTION SATELLITE IMAGES: A COMPARATIVE EVALUATION
    He, X.
    Jiang, S.
    He, S.
    Li, Q.
    Jiang, W.
    Wang, L.
    GEOSPATIAL WEEK 2023, VOL. 48-1, 2023, : 1635 - 1642
  • [28] Multi-Scale Attention Network for Building Extraction from High-Resolution Remote Sensing Images
    Chang, Jing
    He, Xiaohui
    Li, Panle
    Tian, Ting
    Cheng, Xijie
    Qiao, Mengjia
    Zhou, Tao
    Zhang, Beibei
    Chang, Ziqian
    Fan, Tingwei
    SENSORS, 2024, 24 (03)
  • [29] Geospatial Object Detection in High Resolution Satellite Images Based on Multi-Scale Convolutional Neural Network
    Guo, Wei
    Yang, Wen
    Zhang, Haijian
    Hua, Guang
    REMOTE SENSING, 2018, 10 (01)
  • [30] An efficient photogrammetric stereo matching method for high-resolution images
    Li, Yingsong
    Zheng, Shunyi
    Wang, Xiaonan
    Ma, Hao
    COMPUTERS & GEOSCIENCES, 2016, 97 : 58 - 66