HMSM-Net: Hierarchical multi-scale matching network for disparity estimation of high-resolution satellite stereo images

被引:28
作者
He, Sheng [1 ]
Li, Shenhong [1 ]
Jiang, San [2 ]
Jiang, Wanshou [1 ,3 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430079, Peoples R China
[2] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China
[3] Wuhan Univ, Collaborat Innovat Ctr Geospatial Technol, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
Satellite stereo images; Disparity estimation; Convolutional neural network; Hierarchical multi-scale matching; GaoFen-7; dataset;
D O I
10.1016/j.isprsjprs.2022.04.020
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Disparity estimation of satellite stereo images is an essential and challenging task in photogrammetry and remote sensing. Recent researches have greatly promoted the development of disparity estimation algorithms by using CNN (Convolutional Neural Networks) based deep learning techniques. However, it is still difficult to handle intractable regions that are mainly caused by occlusions, disparity discontinuities, texture-less areas, and re-petitive patterns. Besides, the lack of training datasets for satellite stereo images remains another major issue that blocks the usage of CNN techniques due to the difficulty of obtaining ground-truth disparities. In this paper, we propose an end-to-end disparity learning model, termed hierarchical multi-scale matching network (HMSM-Net), for the disparity estimation of high-resolution satellite stereo images. First, multi-scale cost volumes are con-structed by using pyramidal features that capture spatial information of multiple levels, which learn corre-spondences at multiple scales and enable HMSM-Net to be more robust in intractable regions. Second, stereo matching is executed in a hierarchical coarse-to-fine manner by applying supervision to each scale, which allows a lower scale to act as prior knowledge and guides a higher scale to attain finer matching results. Third, a refinement module that incorporates the intensity and gradient information of the input left image is designed to regress a detailed full-resolution disparity map for local structure preservation. For network training and testing, a dense stereo matching dataset is created and published by using GaoFen-7 satellite stereo images. Finally, the proposed network is evaluated on the Urban Semantic 3D and GaoFen-7 datasets. Experimental results demonstrate that HMSM-Net achieves superior accuracy compared with state-of-the-art methods, and the improvement on intractable regions is noteworthy. Additionally, results and comparisons of different methods on the GaoFen-7 dataset show that it can severs as a challenging benchmark for performance assessment of methods applied to disparity estimation of satellite stereo images. The source codes and evaluation dataset are made publicly available at https://github.com/Sheng029/HMSM-Net.
引用
收藏
页码:314 / 330
页数:17
相关论文
共 63 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] [Anonymous], 2016, P INT C LEARN REPR
  • [3] [Anonymous], 2018, VORTEX POOLING IMPRO
  • [4] Atienza R, 2018, IEEE INT CONF ROBOT, P3207
  • [5] RecResNet: A Recurrent Residual CNN Architecture for Disparity Map Enhancement
    Batsos, Konstantinos
    Mordohai, Philippos
    [J]. 2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 238 - 247
  • [6] PatchMatch Stereo - Stereo Matching with Slanted Support Windows
    Bleyer, Michael
    Rhemann, Christoph
    Rother, Carsten
    [J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
  • [7] Semantic Stereo for Incidental Satellite Images
    Bosch, Marc
    Foster, Kevin
    Christie, Gordon
    Wang, Sean
    Hager, Gregory D.
    Brown, Myron
    [J]. 2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1524 - 1532
  • [8] Boykov YY, 2001, EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL I, PROCEEDINGS, P105, DOI 10.1109/ICCV.2001.937505
  • [9] StereoDRNet: Dilated Residual StereoNet
    Chabra, Rohan
    Straub, Julian
    Sweeney, Chris
    Newcombe, Richard
    Fuchs, Henry
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11778 - 11787
  • [10] Pyramid Stereo Matching Network
    Chang, Jia-Ren
    Chen, Yong-Sheng
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5410 - 5418